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Foreword 


The present collective volume is conceived of as the ideal continuation of my mono- 
graph Disital Papyrolosy, which indeed appeared as Volume I with the same pub- 
lisher. The two volumes are part of a project initially named Beyond the Apparatus 
intended to frame past and current issues surrounding the digital tools and methods 
that are being applied to papyrological research and scholarship. In the monograph, 
I tried to sketch the general outlines of electronic resources (bibliographies and 
bibliographical standards, metadata catalogues, virtual corpora, word lists and 
indexes, digital imaging processes, digital palaeography, information media, quan- 
titative analyses, integrated workspaces, textual databases) in an attempt to define 
Digital Papyrology as a self-standing discipline that deals with meta-papyri, i.e. 
papyrus texts in the digital space. Accordingly, I argued that the ultimate purpose of 
Digital Papyrology is the digital critical edition of papyrus texts. The goal of the 
present volume is precisely to investigate this purpose, from the multifaceted view- 
points of the most advanced trends and projects in the field: namely, the deploy- 
ment of platforms suitable for the encoding of proper digital critical editions of both 
documentary and literary Greek papyri and the development of quantitative analy- 
sis methods for the evaluation of the linguistic features of the texts. 

In this challenge, I owe gratitude to my international colleagues and friends 
who have enthusiastically accepted to contribute with their invaluable experience 
in the field: in a rigorous alphabetical order, Rodney Ast (Heidelberg), one of the 
leaders of the Digital Corpus of Literary Papyri (whom I wish to thank for a linguistic 
revision of this Preface); Lajos Berkes (Berlin), member of the Papyri.info editorial 
board and author of several born-digital editions of documentary papyri; Isabella 
Bonati (North-West University, Pochetsfroom, South Africa), soul of the lexico- 
graphical project Medicalia Online; Giuseppe Celano (Leipzig), co-editor of The An- 
cient Greek and Latin Dependency Treebank, with his long-standing experience in 
treebanking and morphological annotation of classical texts; Holger Essler (Wiirz- 
burg), DCLP partner and architect of digital projects about linguistic annotation (the 
Annotated Philodemus), image alignment and automated character recognition in 
the Herculaneum papyri (Anagnosis); Massimo Magnani (Parma), who kindly 
agreed to bring a brilliant classical philologist’s viewpoint to the evaluation of the 
issue at stake; Joanne Stolk (Ghent), co-editor of the Trismegistos database of Text 
Irregularities, with her strong experience in linguistic variation in the papyri and its 
digital treatment; Marja Vierros (Helsinki), who launched (and manages) the path- 
breaking platform Sematia aimed at facilitating linguistic annotation of the papyri. 

On my side, I wish to acknowledge the fact that the volume stems from the pro- 
ject “Online Humanities Scholarship - A Digital Medical Library of Ancient Texts” 
(DIGMEDTEXT: http://www.papirologia.unipr.it/ERC), funded by the European 
Research Council (Advanced Grant Agreement no. 339828) at the University of 
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Parma (2014-2016) and directed by Professor Isabella Andorlini, to the grateful 
memory of whom this volume is dedicated. This statement is not a matter of pure 
bureaucracy. The DIGMEDTEXT project, primarily aimed at creating a database of 
the Greek medical texts on papyrus, has been the breeding ground for more general 
— theoretical, methodological, and technical — reflections about linguistic papyro- 
logical phenomena and their electronic treatment, as well as about the digital criti- 
cal edition of the papyri themselves. It is my hope that the entire papyrological 
community, and in general all scholarship interested in such topics, will enjoy the 
results reached in the past years, and that discussion and development may contin- 
ue further in the future. 


Parma, January 10, 2018 Nicola Reggiani 
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Part 1: Platforms Between Theory and Practice 


Nicola Reggiani 
The Corpus of the Greek Medical Papyri and a 
New Concept of Digital Critical Edition 


1 Defining and shaping a digital critical edition 


Traditionally and basically, a critical edition of a text is the printed output of a philo- 
logical work, i.e. the process of reconstruction of a textual archetype (the ‘source’) 
among different variants, aimed at reproducing the original text as most exactly as 
possible, or, in other terms, as the fixed representation of a scholar’s more or less 
trustable opinion on that text. Accordingly, and rather intuitively, a digital critical edi- 
tion should be defined as the digital output of a philological work. We will see what 
a “digital output” involves in methodological and epistemological terms but, to start, 
it must be noted that traditionally a digital critical edition is regarded as the digital 
transfer of a printed critical edition. Sometimes, this process regretfully gets rid of the 
attribute ‘critical’, so that we have digital editions or textual corpora deprived of ap- 
paratus criticus and therefore ‘uncritical’, as in the well-known cases of the Thesaurus 
Linguae Graecae or of the Perseus Digital Library. This treatment presents encoding 
advantages, since one reference edition is chosen and digitized, but also huge disad- 
vantages in terms of usability, because search and analysis functions are limited to 
the chosen text, without consideration, e.g., for textual variants, alternatives or dif- 
ferent editorial solutions.’ Somewhat ‘hybrid’ editions try to save the constitutio textus 
(the restitution of a text as close as possible to the supposed original) alongside the 
recording of variant readings: for example, the former Duke Databank of Documentary 
Papyri with the spelling variants (as written on the original papyrus) embedded 
within the ‘normalized’ text with special markup.’ A fairer transfer process preserves 
the apparatus criticus, which is usually displayed in a way that resembles the printed 
edition. The simplest examples are PDF editions (either scans of paper samples or 
born-digital files like the publications of the PHerc project)? the most articulated ones 
are the digital editions available at the Papyri.info platform, where critical annota- 
tions, encoded as inline XML markup elements, are processed and displayed in an 


The present contribution is published in the framework of the Project “Online Humanities Scholar- 
ship: A Digital Medical Library Based on Ancient Texts" (DIGMEDTEXT, Principal Investigator Professor 
Isabella Andorlini), funded by the European Research Council (Advanced Grant no. 339828) at the 
University of Parma (http://www.papirologia.unipr.it/ERC). 


1 Seealready DEGANI 1992, and more recently MAGNANI 2008, 135-7; also M. Magnani in this volume. 
2 Cf. REGGIANI 2017, 215-7. 
3 Cf. REGGIANI 2017, 176. 
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apparatus of print-like format, stressing the distance between the ‘correct’ text and 
alternatives, variants, actual textual features. 
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Fig. 1: PSI XV 1510, medical catechism on anatomy, III cent. AD: printed edition and digital edition at 
http://litpap.info/dclp/64024. 


This traditional view is being challenged as rather uncomfortable by the development 
of digital technologies in the ancient studies, as well as by an increasing concern for 
the actual testimonies and the process of textual tradition: we may define it as a sort 
of *phenomenological" approach. Digital projects like the Homer Multitext Project 
(HMT) or the Leipzig Open Fragmentary Texts Series (LOFTS) started envisaging a dif- 
ferent approach to textual criticism, in deploying a text that is in fact a multitext, a 
fluid and dynamic network of multiple editions aligned to each other (by means of a 
URN architecture) rather than a traditional fixed structure of text and apparatus crit- 
icus,^ In this framework, the uneasiness of texts that are felt not being completely 
suitable for a 'traditional' critical edition (e.g. oral Homeric poetry,? fragmentary 


4 Amultitext is basically a dynamic collection of multiple critical editions, a network of versions with 
a single root. As Monica Berti described it, “[i]t produces a representation and visualization of textual 
transmission completely different from print conventions, where the text that is reconstructed by the 
editor is separated from the critical apparatus that is printed at the bottom of the page. [... It] allows 
the reader to have a dynamic visualization of the textual tradition and to perceive the different chan- 
nels of both the transmission and philological production of the text that is usually hidden in the 
static, concise, and necessarily selective critical apparatuses of standard printed editions. Producing 
amultitext, therefore, means producing multiple versions of the same text, which are the representa- 
tion of the different steps of its transmission and reconstruction, from manuscript variants to philo- 
logical conjectures" (BERTI forthcoming, 4). Cf. REGGIANI 2017, 266 ff. 

5 The HMT project concept results from the statement that the Homeric textual evidence does not 
comply with the traditional philological view of textual variants stemming from one archetype, since 
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sources?) merges with the new capabilities of digital infrastructures, which offer 
much more dimensions than printed paper. Hypertext is a new writing space, to 
which editors have to adapt the texts:" 


[o]nce we are able to overcome the physical limits of printed editions by joining together variants 
and conjectures referring to the same texts, it also becomes possible to look at the texts from a 
new and broader perspective, with possible consequences for our knowledge and comprehen- 
sion of them.® 


Thence, an unavoidable fact: 


[w]e need to move in the direction of digitally conceived and initiated types of information and 
away from mopping up information from print sources.? 


Asit has been put very effectively, the hypertext architecture is challenging the Urtext 
model,” and it paves the way for exploring the possibilities of “holistic” models 
where editorial choices are superseded by an interactive network of all extant data, 
with potentially infinite information layers.” Perhaps, the model that better describes 
this ideal condition is an ontology design: 


an ontology is the most suitable solution to represent critical editions of ancient texts for two 
main reasons: first, we want to be able to link different kinds of resources [...] that have in com- 
mon the possibility of being referred to via URIs, which is one of the principles of the Semantic 
Web; second, information contained in critical editions constitutes a layer of interpretation and 
a description of relations about texts that is important to keep clearly distinct from the texts 
themselves. Indeed, the use of stand-off metadata encoded within ontology allows us to express 
an open-ended number of interpretations, whereas a markup-based solution would not make 
this possible due to obvious reasons of overlapping hierarchies.” 


a true original Homeric text never existed (cf. BIRD 2010): a somehow “agnostic” (BODARD — GARCÉS 
2009, 96 n. 31) environment where all witnesses are transcribed and juxtaposed, without preference 
for any of them. See M. Magnani's chapter in the present volume for a critical view of this idea. 

6 Ancient fragments are characterized by a high level of textual complexity, in the relationships 
among the actual text in which they are embedded, its critical edition (interpretation), the original 
source (attribution), the quoting source (witness), etc.: cf. BERTI forthcoming. 

7 Cf. BOLTER 1991; REGGIANI 2017, 263 ff. 

8 ROMANELLO — BERTI — BOSCHETTI — BABEU — CRANE 2009, 165 

9 BAGNALL — GAGOS 2007, 74. 

10 BOLTER 1991. 

11 Cf. BODARD — GARCÉS 2009 

12 ROMANELLO — BERTI — BOSCHETTI — BABEU — CRANE 2009, 158. An ontology is a formal definition of 
types, properties, and interrelationships of the entities belonging to a certain domain of knowledge. 
In other words, it compartmentalizes the variables needed for some set of computations and estab- 
lishes the relationships between them. 
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Fig. 2: A sample ontology model (from ROMANELLO — BERTI — BOSCHETTI — BABEU — CRANE 2009, 167). 


2 Papyrology: philology in flux 


Papyrology is, in its more essential core, all about providing trustable critical editions 
(and commentaries) of papyrus texts.” Though projected towards a broad historical 
and cultural evaluation of the textual data," it is intimately a philological discipline: 
no one can deny that without texts there would exist no Papyrology. Yet it is a very 
peculiar philological discipline, since it is well aware of the fluidity of its objects of 


13 Cf. YOUTIE 1963, 22-3. 
14 Cf. e.g. BAGNALL 1995. 
15 Cf. HANSON 2002, 196; SCHUBERT 2009, 197. 
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study:! texts are continuously published, updated, collected, revised, corrected, 
emended, republished, and there is hunger for resources that can help handling an 
overwhelming amount of primary data.” It is — to borrow the successful concept that 
Zygmunt Bauman launched to emphasize the fact of change in the modern times® — 
a ‘liquid’ philology, for which digital environments seem extremely fitting; in partic- 
ular, collaborative platforms like SoSOL seem the most suitable incarnation of this 
complex and fluid editorial workflow.” 

Moreover, Papyrology has always been facing an adventurous textual situation, 
having to cope with fragmentary and unique texts and idiosyncratic utterances, and 
has developed a remarkable interest in the scribal and material phenomenology of 
textual features and transmission, which affects consistency in treating the wide se- 
ries of textual fluctuations occurring in the papyri. Indeed, while philological analy- 
sis would gladly treat fluctuations as deviations from a standard archetype (i.e. mis- 
takes or, more gently, variants) and normalize them in a reconstructed critical 
edition, they actually bear significant socio-cultural relevance and are of fundamen- 
tal importance from the viewpoint of the phenomenology of the papyrus texts, its in- 
terpretation, and ancient writing culture in general. In other words, very often fluctu- 
ations are not used to reconstruct a text but to investigate relevant socio-cultural 
phenomena. Accordingly, the papyrologists’ behaviour towards such textual flavours 
is twofold, and generates a wide variety of editorial inconsistencies that affect printed 
editions as well as digital databanks. 

As to the latter, the issue at stake is not only critical agreement or scholarly stand- 
ards, but also (as hinted above) the usability of the tools themselves, in terms of 
searching and encoding. The best example, from my own experience, is the case of 
the word épynveia, which often occurs in the papyri in the iotacistic form Eppynvia. 
The spelling 'variant' is treated differently in the printed editions, being sometimes 
'regularized' in the apparatus, sometimes not, generating textual inconsistencies 
even within the very same text.? In BGU I 326, ii 15 épynvia is printed without appa- 
ratus notes, and it is reproduced in the databank as such; in the same text, at 1. i 1 the 
same word is supplied as épunvei]a (following the ‘standard’ form) in the database, 
while all the printed editions (after the ed.pr.: Chr.M. 316; Sel.Pap. I 85; FIRA? III 50; 
Jur.Pap. 25) keep the ‘variant’ in the lacuna too. Another ‘classical’ case is that of the 


16 Cf. YOUTIE 1963, 27-32; HANSON 2002, passim; SCHUBERT 2009, 212-3. 

17 As I pinpointed in REGGIANI 2017, 2-6, this is the basic raison d’étre of Digital Papyrology. 

18 Cf. e.g. BAUMAN 2000; 2007; 2011. 

19 Onthe collaborative structure of the database cf. REGGIANI 2017, 232 ff. All editorial interventions 
are kept recorded in a “History” log, which is available to every user: see L. Berkes’ article in this same 
volume for a screenshot of a sample editorial history on Papyri.info. 

20 Cf. REGGIANI 2018a. Some remarks on the inconsistent treatment of iotacism can be found also in 
J. Stolk’s and M. Vierros’ contribution to this volume. 
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verbal forms of yiyvopoi, which becomes y(e)ivopat in the Koine Greek.” The latter 
forms are indeed treated as the standard in most of the papyrus editions, and there- 
fore are not ‘regularized’ as variants,” but this is not always consistent: editorial reg- 
ularizations do occur, seemingly only when the verb is affected also by iotacism, often 
in compounds.” On the other hand, we do find the classical Greek forms not being 
regularized as well," which increases the uneasiness of anyone who would like to 
perform effective searches in the digital textual corpora. With the further develop- 
ments of the Greek language, the situation is even more complex: for example, the 
general shift from dative to genitive in the later (Byzantine) instances of the language 
of the papyri? leads to further editorial inconsistencies. In BGU XIII 2332,20 (AD 375), 
for instance, Urtápyo + genitive (pov) is regularized in dative (pot) according to the 
classical use, whereas in SB XVIII 13947,15 (AD 507) bmdpyw + dative (por) is regu- 
larized in genitive (pov) as if the latter was then the correct form.” One must be aware 
of any possible spelling or syntactic combination to perform trustable textual 
searches.” 

As is apparent, papyrus texts carry a cognitive complex that is often hard to fit 
into printed editions and may find its better representation in the digital space, where 
the objects of study undergo a process of dematerialization. I have already argued 
that the development of Digital Papyrology, in its treatment of computerized infor- 
mation about papyri, produced the effect of working on the virtual representation 
(avatar) of the papyri themselves, which turn to be meta-texts,” in the terms already 
envisaged by Traianos Gagos as early as 1998: 


In this new era of papyrological research, we cannot speak of a collection of papyri alone, but 
also of a collection of electronic files, data, metadata and digital images:?? 


21 Cf. DEPAUW — STOLK 2015 and J. Stolk's chapter in this volume. 

22 A quick survey of a sample search in Papyri.info can give a global idea of this trend: http:// 
papyri.info/search?STRING1=yetvop&targetl=TEXT&no_caps1=on&no_marks1=on&STRING2=NOT+ 
ytyvo&target2- TEXT&no caps2-on&no marks2-on. 

23 nopoyíe]werat l. mapayiyvetat in BGU XVI 2651,6; yeiveo0at l. ytyveo0at in Chr.M. 1721,15; 
katayetv[o]pfat] l. karoytyvopat in P.Bodl. I 17,i,9; mapayetvopat l. mapayiyvopat in P.Haun. II 22,5; 
neptyeivopévwv l. tepryiyvopévwv in P.Stras. VIII 772 passim. Note the double possible regularization 
yiyveo0ar or yevéo8at advanced for yeiveo8at in P.Col. X 280,13. 

24 Another sample search: http://papyri.info/search?STRING1=ytyvop&target1=TEXT&no_caps1= 
on&no marks1-on. 

25 Cf. STOLK 2015b. 

26 For more similar cases cf. STOLK 2015a, 85 ff., and 2015b. 

27 Cf. DEPAUW — STOLK 2015, 213. See also STOLK 2015a, 93. 

28 On these topics cf. REGGIANI 2018a, 2018b, 2018c, and J. Stolk in this volume. 

29 Cf. REGGIANI 2017, 260 ff. 

30 GAGOS 2001, 516. 
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The availability of huge amounts of information in fully searchable textual form with accompa- 
nying images through these new media is altering drastically the definition of what constitutes 
a ‘text’, the way we experience reading it and, ultimately, the plurality of messages a text can 
offer to one or more readers. The new methods of presenting text with marked up images and the 
simultaneous availability ofa variety of other research tools within the same electronic environ- 
ment give us new ways of visualizing and approaching a given text. An edited text is no more a 
static, isolated object, but a growing and changeable amalgam: the image allows the user to look 
critically at the ‘established’ text and to challenge continuously the authoritative readings and 
interpretation of its first or subsequent editors. 

Furthermore, the simultaneous access to and study of thousands of texts and their images that 
could be as far apart as a millennium, in a single search and through the same medium, has the 
potential to challenge our established notions of the ‘messages’ a text carries within itself, its 
textuality and intertextuality [...]. As Roland Barth [sic] explains: ‘Any text is an intertext; other 
texts are present in it, at varying levels, in more or less recognizable forms: the texts of the pre- 
vious and surrounding cultures. Any text is a new tissue of past citations. Bits of codes, formulae, 
rhythmic models, fragments of social languages, etc. pass into the text and are redistributed 
within it, for there is always language before and around the text’. In one or another way, papy- 
rologists have always recognized the “intertextuality” of the Greek papyri from Egypt, because 
of the multicultural and multi-ethnic environment in which these texts were born. The develop- 
ment of the new electronic media in our field and the capability to establish these cross-links — 
or these intertextual signifiers, so to speak - on the linguistic, cultural and historical level 
through the interaction of multiple texts, images and a variety of related tools places the notions 
of textuality, intertextuality and metatextuality on a new (electronic) platform which, in turn, 
becomes part of these notions as the ‘carrier’, ‘interpreter’ and ‘distributor’ of these texts.” 


The concept that Digital Papyrology redefines the notion of ‘papyrus’ is embedded in 
the consideration that 


these media, when used within a wider intellectual perspective as a cognitive tool for research 
and instruction and not only as a pragmatic medium that can ‘do certain things for us’, can chal- 
lenge and redefine notions of ‘text’ and textuality.” 


After realizing that we are coping with enhanced papyri that are in fact ‘meta-papyri’, 
we need to reshape the digital edition in accordance with the nature of the papyro- 
logical digital data as autonomous intellectual objects (following the definition of 
what is ‘data’ for the humanists according to OWENS 2011),? and the possibilities of- 
fered by the electronic meta-space.™ There is a momentous chance to see the digital 
document not as the mere, more or less complete reproduction of a printed critical 


31 GAGOS 2001, 514-6. 

32 GAGOS 2001, 515 n. 8. 

33 At the same time constructed artefacts, being created by people, and interpretable texts, they “can 
hold the same potential evidentiary value as any other kind of artifacts”. 

34 See also the observations by M. Magnani in this volume. 
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edition, but as a quantum particle of a fluid universe of text transmission. This ‘dis- 
positive’ — in foucaldian terms? — may find a suitable representation through the 
abovementioned ontology design, where we do not have to decide what is ‘regular’ 
or ‘normal’ and what is a ‘secondary’ reading, but can create an interconnected net- 
work of aligned versions, which represent different possible layers of textuality:? 


[o]nly with a comprehensive understanding of the content and assumptions of the traditional 
hughly-evolved critical apparatuses will we make the right strategic decisions for the future of 
textual scholarship.” 


Philology tends to overcome any textual fluctuation in favour of a reconstructed text 
that be as closest as possible to the ‘original’ source, but documentary papyri are actu- 
ally the original source of themselves (any critical interventions being configured as the 
reconstruction of an imaginary archetype), while literary and paraliterary papyri pre- 
sent more complex issues, as introduced below. They are therefore among the best text 
typologies suitable for exploring new ways of conceiving digital critical editions. 


3 The medical papyri: special technical needs of a 
special technical corpus 


Within the framework sketched above, the Digital Corpus of the Greek Medical Papyri 
project? proved pathbreaking in applying the notion of digital edition to literary and 
paraliterary papyri, previously excluded from Papyri.info and object of very specific 
and isolated projects (CPP, THV etc.). The project stemmed from Isabella Andorlini's 
lifelong interest in the medical papyri and from her own challenge to collect them in 


35 LAMÉ2014 describes this idea (with reference to ancient epigraphs) through Foucault's philosoph- 
ical concept of dispositive: the message of the text-bearing object can be completely understood in 
relation with a complex network of many other heterogeneous pieces of information. The ultimate 
purpose is "to digitize also the network that connected those information systems, instead of digitiz- 
ing each individually". 

36 The platform Sematia, discussed by M. Vierros in this volume, is a nice example of how the tran- 
scription of the actual papyrus text can be aligned to a ‘regularized’ layer of the same text, so that any 
possible information is kept in an interactive way. 

37 DAMON 2016, 218. 

38 With DCGMP I refer to the whole digital corpus of the Greek medical papyri as resulted from the 
work of the DIGMEDTEXT project mentioned in the Introduction to this volume. The title Corpus dei 
Papiri Greci di Medicina Online (*Online Corpus of the Greek Medical Papyri") refers to the first stages 
of the project. Bibliography: ANDORLINI — REGGIANI 2012; REGGIANI 2015; 2016a; 2017, 273-5; ANDORLINI 
2017; BERTONAZZI 2018a, 24—9; REGGIANI 2018b; http://www.papirologia.unipr.it/ CPGM. 
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a uniform and homogeneous corpus.” Her first steps went towards the printed me- 
dium,^ but she soon realized the strong potentials of Papyri.info to host dynamic pap- 
yrological editions,^ and her project later became one of the leading pilot test cases 
of the rising Digital Corpus of Literary Papyrology? in envisaging new technical and 
theoretical strategies for the encoding of literary and paraliterary texts, eventually 
awarded with an ERC advanced grant (http://www.papirologia.unipr.it/ERC). 

Medical papyri are technical texts: they have been conceived to convey a tech- 
nical knowledge, i.e. theoretical and practical specialized information at the same 
time — a knowledge that is, in turn, mirrored and refracted in the different written 
genres encompassed by the corpus.? The importance of medical technical skills is 
apparent, and not only for health reasons (think of Galen's instructions to the patients 
so that they can choose the best doctor after an enquiry on his skills): one might 
recall P.Oxy. I 40 (4 BL I 312, V 74, VI 95; Oxyrhynchus, II cent. AD), a copy of the 
report of a court judgement where a public doctor claims for immunity from some 
liturgies, and the judge, after a rather witty remark, requests a scientific proof of his 
assertion.^ The importance of written text for this education is stressed as earlier as 
in the Hippocratic corpus: *I consider the ability to evaluate correctly what has been 
written as an important part of the art" — says the author of the Epidemics — “He who 
has knowledge of it and knows how to use it will not commit, in my opinion, serious 
errors in the professional practice" (Epid. III 16 = III 10,7 ff. L.). In fact, the transmis- 
sion of this knowledge was carefully carried out through a specialized education, 
which was based on oral teachings later entrusted to written supports. In the intro- 
duction to the treatise On his own books, Galen himself explains how in the context of 
the oral lesson one used to take written notes, thence moving to the publication of 
memoranda, the hypomnemata of the lessons heard.“ 

Stemming from both the knowledge of oral teaching and the know-how of prac- 
tical records and individual experience, every medical writing is not a fixed book but 
a tool in flux: the older treatises are annotated, commented, collated often against 
annotated and commented copies," transcribed with additions, corrections, and up- 
dates; the collections of personal notes on clinical cases, therapies or remedies are 


39 Cf. REGGIANI 2018d. 

40 Cf. ANDORLINI 1997a. 

41 Cf. ANDORLINI — REGGIANI 2012, 138-9; BAGNALL 2012, 4. 

42 See the chapter by R. Ast and H. Essler in this volume. 

43 Cf. ANDORLINI 1993; REGGIANI 2018e. 

44 Cf. NUTTON 1990. 

45 On official examinations of physicians see REGGIANI 2018f. 

46 Cf. NUTTON 1972; NIEDDU 1992, 555-7; ANDORLINI 2003, 14. 

47 On the collation of annotated copies, always according to Galen’s words (In Hp. Off. III 22 = XVIIIb 
863,14—865,5 K.; In Hp. Epid. II 8 - XVIIa 634,3-7 K.), cf. ANDORLINI 2003, 15, who recalls (note 15) the 
story of Mnemon, who took the third book of Hippocrates' Epidemics from the library of Alexandria 
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constantly revised on the ground of practice; prescriptions are transcribed, ex- 
changed, collected, gathered in the receptaria and passed down; handbooks of dif- 
ferent typologies are used to teach again, and so on, keeping on the written support 
traces of every stage of transmission and use." 

Such texts are further characterized by intertextual and transtextual connections: 
references, quotations, more or less literal parallels, are another key to understand 
and contextualize the matter at the best. The recipes, and their collections known as 
receptaria, find inspiration in the pharmacological treatises and are further enriched 
by the doctors’ personal practice and by references and quotations from different 
medical sources; the questionnaires are connected to the literary tradition of the Def- 
initiones medicae (see below). These are by no means stemmatological relationships 
between ascendants and descendants: it is a fluid knowledge undergoing continuous 
changes, updates, adaptations, much influenced by oral teaching and actual prac- 
tice. Accordingly, the very textual data interweave with a huge panel of textual de- 
vices, which contribute to articulate an expressive network that is essential to the 
medical writing itself, to its transmission, to its learning, and to its practical use: 
therefore, they deserve a particularly careful consideration. Critical and diacritical 
marks, punctuation, graphical and layout features, technical terms and formulae, lit- 
erary or sub-literary references or echoes, marginal annotations - to cite the most 
outstanding devices — form a complex interplay that cannot be separated from the 
text itself, nor — even more - ignored, without compromising the correct interpreta- 
tion of the evidence. Rigid definitions of philological variants do not really apply, as 
well as the treatment of linguistic variants can be more complex than the simple ap- 
plication of regularization markup tags, which categorize a ‘standard’ (not to say ‘cor- 
rect’) and a ‘deviant’ version of a word." 

The inadequacy of the traditional philological/stemmatological model to repre- 
sent in full the textual features of these complex and fluid technical writings has al- 
ready been pointed out by Ann Hanson,” who advanced an “accretive model of com- 
position” to provide a suitable description of the phenomenon. In David Leith’s 
words, 


[t]he textual tradition of compilations of this sort was highly fluid, and we should not conclude 
that they represent exactly the same text.” 


and brought it back with the marginal addition of marks indicating clinical histories, traced with dark 
ink and big letters, in imitation of the original handwriting. Cf. also BONATI 2016b, 63-4, and see be- 
low. 

48 Cf. ANDORLINI 2003; REGGIANI 2018e and 2018g. 

49 Cf. REGGIANI 2018a for further details, and see below. 

50 HANSON 1997. 

51 D. LEITH, P.Oxy. LXXX 5239. 


The Corpus of the Greek Medical Papyri and a New Concept of Digital Critical Edition — 13 


Digital tools offer now the best solutions to face this challenge, which Isabella An- 
dorlini herself envisioned at the very beginning of the Corpus dei Papiri Greci di Me- 
dicina project, a primary focus of which was 


la nuova attenzione rivolta alle problematiche editoriali dei testi studiati nella complessità dei 
rapporti con le fonti rispetto alla tradizione conosciuta.5 


4 Envisioning digital critical editions of (medical) 
papyri 


As I anticipated above, a digital critical edition can be defined as the digital output of 
a digital philological work. This has a vital outcome in terms of data encoding. In- 
deed, encoding data involves a digital critical workflow that takes the features of the 
original texts (and of the printed editions) to adapt them to the digital medium. It 
requires a thorough philological work, namely a digital philological one, where the 
digital papyrologist is — to paraphrase Youtie’s well-known definition — an artificer of 
data (in the abovementioned, intellectual meaning of ‘data’). Any information taken 
from the text or from previous editions becomes data (or metadata, i.e. ‘data about 
data’); and even when encoding a print-published edition, one should check carefully 
the original text to avoid possible inconsistencies and ambiguities inherited by the 
previous editors, so that the ‘liquid’ editorial flux goes on. 
Moreover, 


[e]lncoding fragments is first of all the result of interpreting them, developing a language appro- 
priate for representing every element of their textual features, thus creating meta—information 
through an accurate and elaborate semantic markup. Editing fragments, therefore, signifies pro- 
ducing meta—editions that are different from printed ones because they consist not only of iso- 
lated quotations but also of pointers to the original contexts from which the fragments have been 
extracted. On a broader level, the goal of a digital edition of fragments is to represent multiple 
transtextual relationships as they are defined in literary criticism [...]. Designing a digital edition 
of fragments also means finding digital paradigms and solutions to express information about 
printed critical editions and their editorial and conventional features. Working on a digital edi- 
tion means converting traditional tools and resources used by scholars such as canonical refer- 
ences, tables of concordances, and indexes into machine actionable contents. 


Therefore, “encoding a text is an interpretive act”™ by itself: on the one hand, the 
encoder (the digital papyrologist) must employ as much criticism and careful discern- 


52 ANDORLINI 1997a, 19 (cf. ibid., 21-2) 
53 BERTI forthcoming, 2. 
54 OWENS 2011. 
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ment as possible in order to give the papyrological object its correct digital represen- 
tation. On the other hand, one must be aware of the fact that the digital medium has 
different requirements than the printed one. While philology is a way of describing a 
text to interpret it as a stable source, a phenomenological approach is a way of repre- 
senting a text in all its components, to describe and understand the underlying se- 
mantics. Therefore, when we choose to overcome the inadequacies of a traditional 
critical edition in favour of the digital multi-space, we must keep in mind the follow- 
ing three fundamental requirements: 

— standardization (adapting to the digital medium means to follow its strict rules); 
— semantic representation (which may differ from the traditional philological rep- 

resentation, as we will be noticing below); 
— usability (in terms of data access, searching and developing options). 


Data and metadata can be encoded and used as different, yet interconnected 
(aligned) information layers. An XML annotation markup seems to be the best en- 
coding strategy, since it has a consolidated background in the TEI/EpiDoc system that 
has already been adapted to the papyrological requirements,” providing a standard- 
ized and standardizing framework, a semantic annotation, and powerful search op- 
tions through XPath and XQuery querying languages.” It also allows for any kind of 
final rendering by means of customizable transformation languages (XSLT). Align- 
ment among layers can be achieved by deploying a CTS URN architecture, which is 
useful to give unique identifiers to each element and to avoid overlapping hierar- 
chies, especially in linguistic annotation. Annotated layers can be stored in a GIT re- 
pository so that open access and collaboration are granted. Some layers already exist 
in the SoSOL infrastructure (metadata, introduction and commentary, translation, 
annotated text); more can be envisioned, for example, on the ground of Gérard Ge- 
nette's textual theory, which describes all possible relations among texts and which 
has already been claimed as the privileged interlocutor of the complex textual 'dis- 
positive’ of papyrus texts.” 


55 This is indeed a key issue in Digital Papyrology (cf. REGGIANI 2017, passim) as felt by the very first 
‘fathers’ of the papyrological databases (cf. TOMSIN 1970, 476). 

56 Istarted envisaging this strategy for the medical corpus in REGGIANI 2015, where I sketched some 
possible annotation layers (the article stems from a conference paper delivered in 2012, at the very 
beginnings of the DCGMP project). I revised my argument in REGGIANI 2016a. 

57 Seethe overview discussed by J. Stolk in this volume. In the following pages, I will be referring to 
the online Leiden+ guidelines at http://papyri.info/docs/leiden plus. 

58 See the query cases mentioned by M. Vierros and G. Celano in this volume. 

59 “This ‘intertextuality’ of the text is what G. Genette would call ‘transtextuality’. It is not, perhaps, 
accidental that postmodern theories on language and 'text' developed more or less at the same time 
with the spread of the electronic media" (GAGOS 2001, 515 n. 8). Cf. GENETTE 1992. I outline the possible 
exploitation of Genette's textual theory in relation with the complex textuality of Greek medical pa- 
pyri in REGGIANI 2018c and 2018e. 
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Without any presumption of emulating Herbert C. Youtie, who provided the 
standard outline of a ‘canonical’ papyrus edition, what follows is an attempt of sys- 
tematizing the extant strategies for encoding a digital papyrus edition, with some sug- 
gestions for possible further improvements. The past work on the medical papyri pro- 
vided the most complex and intriguing cases, but the same recommendations can 
apply to simpler cases too, as well as to documentary papyri of any sort. 


4.1 Metadata and bibliography 


Papyrus metadata (i.e. contextual information about texts: chronology, provenance, 
etc.) are currently stored in digital catalogues like the Heidelberger Gesamtverzeichnis 
(HGV) for the ‘documentary’ texts, the Leuven Database of Ancient Books (LDAB) and 
the Mertens-Pack? (M-P?) for the ‘literary’ ones, Trismegistos (TM) for both, and some 
more specialized ones like the Corpus of Paraliterary Papyri (CPP) or Synallagma. Sim- 
ilarly, digital bibliographical repositories exist, namely the Bibliographie Papy- 
rologique (BP) and Trismegistos bibliographies.“ Papyri.info and the DCLP currently 
include metadata from (respectively) HGV + TM and from LDAB,* and point to BP 
records as well as to some more resources (e.g. Synallagma). A digital critical edition 
of medical papyri should of course extend this feature to include the Mertens-Pack? 
(in its specific Medici et medica section)? and possibly to envision some digital ver- 
sion of Marganne’s and Andorlini’s printed catalogues of medical papyri.“ Contextu- 
alization is indeed fundamental: 


[l]o studio del manufatto e una sua corretta collocazione cronologica sono informazioni essen- 
ziali, che possono interferire con le ipotesi di attribuzione dei contenuti, sia per il rapporto con 
gli autori noti, sia per un’adeguata impostazione dell’indagine sulle fonti e sugli anelli della tra- 
dizione indiretta. La provenienza del reperto papiraceo può, nei casi in cui gli elementi archeo- 
logici siano conosciuti, conservare dati preziosi sul contesto in cui inserire le farine di produ- 
zione libraria antica, e sui livelli della sua divulgazione in Egitto (centri di diffusione legati alle 
vie dell’insegnamento e della pratica della disciplina; biblioteche templari, scuole mediche spe- 
cializzate): l’attenzione ai luoghi accertabili di ritrovamento dei reperti ci permette di delineare 
il milieu culturale in cui libri di questo genere furono prodotti, o semplicemente letti, da fruitori 
professionisti e da gente colta con qualche interesse per i temi della salute. 


60 YOUTIE 1963, 22-3. 

61 On these resources cf. REGGIANI 2017, 39 ff. (catalogues) and 14 ff. (bibliographies) respectively. 
62 See R. Ast and H. Essler in this volume. 

63 Cf. MARGANNE — MERTENS 1997 and the online resources cited in REGGIANI 2017, 34. 

64 MARGANNE 1981a; ANDORLINI 1993. 

65 See also M. Vierros’ remarks about metadata of documentary papyri in her article for this volume. 
66 ANDORLINI 1997a, 21. 
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Moreover, the standardizing potential of digital metadata‘ would be a nice ground to 
deal with the problem of the definition of textual genres or typologies, and to face the 
challenge of a categorization, an issue that is well framed — from the medical view- 
point, in the wake of Isabella Andorlini — by Francesca Bertonazzi in the following 
words. 


Classificare i papiri per tipologia non è solo un mero esercizio erudito o, peggio, sterilmente ma- 
tematico nel senso deteriore del termine: al contrario si configura come un’indagine che può 
gettare luce sul contesto di composizione e d’uso del testo, e non di rado può agevolare la sua 
ricostruzione filologica e l’interpretazione esegetica. L’attività non è priva di rischi: un primo 
problema è sottolineato da quanti mettono in guardia dalla rilevanza statistica dei dati che pos- 
sono essere desunti dai papiri, che sono inevitabilmente vincolati ai ritrovamenti, al tipo di de- 
scrizione fornita dal primo editore, dal tipo di classificazione operata nei primi studi sul testo. Il 
primo ostacolo, per così dire, è dunque di natura extratestuale, ovvero risiede nella mera quan- 
tità di papiri appartenenti a una data tipologia: anche se la maggior parte dei papiri medici affe- 
rissero al genere, e.g., del trattato, non per questo si dovrebbe concludere che il trattato fosse il 
genere più praticato in ambito medico nell’Egitto greco-romano. Un secondo problema, di tipo 
intratestuale, risiede nella tipologia stessa del documento, che spesso non appartiene in modo 
netto all’uno o all’altro tipo di testo: “Chi si è occupato anche solo marginalmente della inter- 
pretazione di frammenti di papiro a contenuto ‘medico’, avrà constatato come una delle diffi- 
coltà più evidenti è quella del riconoscimento e della definizione del genere testuale, del tipo di 
opera cui appartennero brani parziali di scritti oggi in larga parte perduti. Una difficoltà dovuta, 
oltre che alla casualità e alla precarietà del reperto papiraceo, anche alla organizzazione stessa 
delle opere a contenuto medico, teorico o specialistico che fosse: il riconoscimento di soggetti e 
termini medici è da solo insufficiente per dirci qualcosa di più preciso sull’impostazione 
dell’opera originaria, in quanto le singole nozioni tecniche ricorrevano in settori diversi della 
disciplina, e potevano essere esposte o discusse a livelli di approfondimento e di concettualiz- 
zazione anche molto distanti tra loro” [ANDORLINI 1997b, 159]. 

In quest'ottica, lo studio del corpus offre alcuni casi interessanti di testi a mezzo tra l'una o l’altra 
tipologia (come P.Oxy. 2.234 + 52.3654,92 tra il catechismo e la raccolta di prescrizioni), oppure 
di informazioni testuali insufficienti a distinguere con precisione l'appartenenza tipologica 
(come in P.Oxy. 74.4973: il testo potrebbe riguardare la veterinaria come la fisiognomica), o an- 
cora di testi che pur rientrando nella categoria ‘lettera’, possono avere natura documentaria 
(come MPER 13.6 e GMP 2.10, lettere redatte da medici, e P.Mert. 1.12, lettera a un medico) oppure 
letteraria (P.Oxy. 9.1184 raccoglie varie lettere di Ippocrate). 

Un terzo problema, di ordine linguistico, risiede nella terminologia moderna utilizzata per clas- 
sificare i testi: non di rado si è avvertita la necessità di puntualizzare le varie accezioni di ‘eti- 
chette linguistiche’ attribuite a generi antichi: *[n]el classificare la ricettazione nei papiri ho vo- 
lutamente differenziato l’uso del termine ‘prescrizione’ (applicato a medicine complete di 
indicazione terapeutica, norme estese alla preparazione e all’uso dei rimedi), da quello di ‘ri- 
cetta’ (applicato a formule assai semplificate, limitate all'indicazione dei componenti, attestate 
anche singolarmente su foglietti di papiro ed ostraca). Con ‘prescrizione’ e ‘ricetta’ identifico 
perciò tipologie leggermente differenti di testi. Definisco col termine ‘ricettario’ un testo poco 
elaborato formalmente, che raccoglie ricette o prescrizioni; con ‘manuale terapeutico’ intendo 


67 On standardization in papyrological metadata see REGGIANI 2017, 74-8. 
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uno scritto in cui si riconosce un'organizzazione compositiva e formale piü complessa, ora pro- 
dotto nell’ambito dell’insegnamento della disciplina, ora non diverso dai modelli di ‘trattato’ 
terapeutico” [ANDORLINI 1993, 469-70 n. 22]. 


4.2 Introduction and commentary 


The possibility to add a “front matter” and a “line-by-line commentary” is currently 
allowed by both Papyri.info and the DCLP, though it has been poorly exploited so far. 
Some documentary samples have been produced in the framework of the born-digital 
editions described by L. Berkes in this volume; for the DCLP side, see R. Ast and H. 
Essler ibidem. The DCGMP project utilizes systematically this feature to provide a gen- 
eral introduction to each text and to record the main textual features that cannot be 
encoded within the text for the moment, namely technical descriptions? (see below 
for future integration with the Medicalia Online lexical tool) and parallel passages in 
both other medical papyri and literature (see below for the intertextual layer). 

An earlier way of inserting short comment strings (mostly providing information 
about re-editions of the texts) within the inline markup, through the <note> tag (Lei- 
den+: /* */), is possible but definitely not exploited nor really recommended (the 
Leiden+ guidelines warn: “use sparingly”!) 


4.3 Translation 


Translations of the original text in multiple modern languages are currently sup- 
ported in the existing databases. The DCGMP policy is to produce at least an English 
translation of each text, but when a scholarly translation in a different language does 
exist, the preference is granted to that one. As long as translation is a means of inter- 
pretation, the possibility to align the original text with its translation(s)”° is worth be- 
ing explored, for instance through the Medicalia Online lexical platform (see below). 


4.4 Materiality 


The physical appearance of the papyrus is of the utmost importance for the papyrol- 
ogists, who are deeply interested in the material aspect of the fragments.” Size and 
colour are the first physical features that are indicated in a traditional edition, and 


68 BERTONAZZI 2018a, 48-51 (see also pp. 51 ff.). 

69 In compliance with one of the original goals of the Corpus dei Papiri Greci di Medicina, i.e. the 
historical-scientific perspective described by ANDORLINI 1997a, 23. 

70 On translation alignment cf. e.g. VERONIS 2000. 

71 See the remarks by R. Ast and H. Essler in this volume. 
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since they are not recorded in the metadata catalogues, they should be indicated in 
the introductory matter (see above) or in new metadata fields. A digital picture could 
compensate for this, but itis not available for all papyri (see below). 

Material features of the writing support are encoded directly in the text itself ac- 
cording to the current standards. As to this point, there are some notabilia that must 
be stressed because they slightly differ from the traditional editorial practice. Line 
numbers, for example, are to be indicated for each line (contrarily to what happens 
in most of the printed editions) in a standardized way (number-dot-space); words that 
wrap between two lines are indicated with a hyphen after the dot of the second line 
number, not at the end of the first line. Both Leiden+ procedures may seem rather 
unconventional to ‘traditional’ papyrologists, but they are grounded on XML require- 
ments: numbers are related to “line break” tags (<1b/>) that must open each new 
line of the encoded text; hyphens represent the attribute break="no" in the same 
<1b/> tag, meaning that the new line does not break the word.” In the HTML output 
things are brought back to the traditional display (line numbers grouped by five, hy- 
phens at the word break). 

Writing sides (recto/verso, folios in codices) and multiple fragments are encoded 
as document divisions (XML «div type="textpart">). This tag deploys an n at- 
tribute, which expresses the number/letter identifying the fragment/folio (or the let- 
ters r/v for recto/verso), and a subt ype attribute, defining the type of part: " frag- 
ment", "folio", but also "column" or "part" if the text is divided into different 
layouts or sections even within the same writing side. By the way, this is a good way 
of dealing with texts that are composed by several sub-texts, like e.g. collections of 
letters or recipes.” Divs can be nested if needed, and each text block is anyway en- 
closed by an <ab> tag (“anonymous block”). In Leiden+, Divs are introduced by the 
tag <D= followed by the said attributes preceded by dot (e.g. <D=.r for recto, 
<D=.1.fragment for fragment 1), the text block by <=. Every text tag must be closed 
at the bottom, paying attention to the correct order (divs are opened before <ab> at 
the beginning, and symmetrically closed at the end). 

The most remarkable physical feature of the papyri is fragmentation. This usually 
results in marginal breaks (printed as rows of dashes at the top and/or at the bottom, 
closed square bracket on the left, open square brackets on the right) and in-text gaps 
(represented as square brackets surrounding some indication of the missing text, 
which may or may not be supplemented). They are currently encoded as in-text 
markup; however, the digital concept of gap, according to the TEI/EpiDoc canons, is 
slightly different from the traditional one, and it deserves some comments. Each un- 
supplied break or lacuna is indeed treated as missing text, and all types of missing 


72 See the contribution by G. Celano in this volume for the problems given by non-breaking lines in 
the digital papyrus texts. 
73 Anattempt of this can be found at http://litpap.info/dclp/60175 (P.Oxy. IX 1184, Hippocratic letters). 


The Corpus of the Greek Medical Papyri and a New Concept of Digital Critical Edition — 19 


text are handled with the <gap> XML tag, which is used to encode both lost and il- 
legible portions of text (the latter being usually printed — and are displayed - as series 
of dots). The «gap» attribute reason distinguishes the two cases ("lost" or "il- 
legible", plus "ellipsis" ifthe text is missing because left untranscribed by the 
editor), while the attribute unit specifies if we are dealing with just a number of 
characters or with entire lines. The extension of the missing text is defined by the 
attributes extent ("unknown" number of chartacters or lines), quantity (known 
number of characters or lines), atLeast / atMost (approximate range calculation). 
A precision attribute set to "low" indicates the uncertainty of an extension. Lei- 
den+ syntax developed around the use of dot, after the print conventions of indicat- 
ing illegible characters by means of dots: a dot followed by a number or a range (and 
by the indication 1in when dealing with lines) indicates illegible text; the same, but 
preceded by the indication lost, marks lost text. If the dot is preceded by vestig, 
an element <desc>vestiges</desc>is added to the «gap reason="illegi- 
ble"> tag, in order to encode generic ‘traces’ (which is indeed the HTML output). 
Untranscribed text is marked differently (see below), as are supplied gaps, though 
from the papyrological viewpoint they are actually the same facts as the unsupplied 
ones (see below). 

Unclear characters are another good example of how semantic markup differs 
from traditional print editions. In the latter, any unclear letter is marked with an un- 
derdot, either with the letter on its top (if legible) or not (if illegible). In the digital 
edition, illegible characters are non-textual portions marked with the <gap rea- 
son="illegible"> tag described above, while unclear but legible characters are 
text portions marked with an <unclear> tag. Leiden+ utilizes the regular Unicode 
underdot in the latter case, while in the former the dot is recalled with a full stop fol- 
lowed by the number or range of unclear characters. In the HTML display, they be- 
come both underdots. 

The close relationship between the text and its support is the core focus of the 
CRMtex project (http://www.cidoc-crm.org/crmtex), which provides tools for manag- 
ing the study and publication of ancient handwritten documents and may be taken into 
consideration for developing new strategies in the digital edition of papyrus texts too. 


4.5 Palaeography 


Annotating palaeography is a huge task. Beside a general palaeographical descrip- 
tion of the handwriting, which may well be detailed in the front matter, the possibility 
to mark up each single character is particularly tricky. Apart from its extreme intri- 
cacy, such a task should be preceded by a huge effort to standardize palaeographical 
terms and descriptions, which are notoriously idiosyncratic and inconsistent. Text 
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alignment with the digital picture can help: this is precisely the purpose of the Ana- 
gnosis project, conducted at Wiirzburg by Holger Essler, which may eventually come 
to the automatic recognition of the characters.“ 

Some visual characteristics of the written text are encoded with an appropriate 
markup that describes the appearance of lines, words or single characters with spe- 
cial display features. Lines that are written perpendicular or inverse with respect to 
the main body of the text can be encoded with a rend="perpendicular"/"in- 
verse" attribute of the <1b/> tag (in Leiden+, this is obtained by putting the line 
number in brackets, typing a comma+space instead of dot+space, and then the ap- 
propriate attribute value). Similarly, ancient text highlights (taller characters, super- 
script, subscript, supraline, underline) are tagged with a <hi> element, with a rend 
attribute specifying the kind of highlighting (standard values: "tall", "super- 
script", "subscript", "supraline", "supraline-underline"). Leiden+ 
equivalents are shaped in a graphical appearance that hints to the text display on the 
papyrus (respectively: ~| |x| |~tall; 1^x^l; VIx|/; x ; =x=). It must be noted 
that currently the use of this markup is deprecated when the highlighting describes an 
abbreviation (see below). A text written inside a box is encoded with a milestone ele- 
ment (<milestone rend="box" unit="undefined"/>; Leiden+: ###).” 

Another palaeographical feature that can be encoded with the current markup is 
the handshift (<handShift new="m2"/>; Leiden+: $m2; displayed as (hand 2)); 
for the use of this tag see also Marja Vierros’ chapter in the present volume (which — by 
the way - contains also an interesting discussion about palaeographical metadata). 


4.6 Text 


At the core of the papyrus fragment, text as a linguistic fact deserves the highest and 
deepest attention, for both the peculiarities of the language of the papyri in general 
and the specific relevance of technical language in small corpora like the medical 
writings.” Digital annotation is a fundamental practice in the linguistic study of a 
corpus of texts:” it allows to describe, record, interpret and analyse linguistic infor- 
mation at several levels, in which each layer corresponds to a particular category of 


74 See below and the Anagnosis section of the chapter by R. Ast and H. Essler in this volume; cf. 
REGGIANI 2017, 151 ff. 

75 Cf. CORAZZA 2018a; see below for ‘milestones’. 

76 The lexical and linguistic study has always been a primary purpose of the Corpus dei Papiri Greci 
di Medicina: cf. ANDORLINI 1997a, 24. 

77 On the definition of linguistic corpus cf. SINCLAIR 1996; in general on corpus linguistics cf. LÜ- 
DELING — KYT6 2008-09; LUDELING 2011; and see M. Vierros and I. Bonati in this volume. On the theo- 
retical and practical correctness of treating Greek medical papyri as a proper textual corpus I think 
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relevant information.“ Multiple levels of linguistic annotation of papyrological rele- 
vance can be outlined.” 


4.6.1 Part-of-speech annotation 


The basic annotation layer, related to the analysis of the parts of speech — also known 
as treebank because it is usually represented with a tree graph —, would allow to con- 
duct an extensive lexical, phraseological-formulaic and syntactic analysis on the cor- 
pus, aimed also (but not only) at discovering styles and writing strategies specific of 
the medical texts, both literary and documentary: think only of the possibility to in- 
vestigate formulaic uses and writing skills,*° to find out influences or interpolations 
between authors, or the presence of literary echoes in technical or documentary 
texts.?' The entire technical textual strategy deployed by medical authors? could be 
studied in this way. Analysing in depth and comprehending the syntactic structure of 
texts would allow also to solve problems of interpretation and attribution,? or even 


there must be no doubt. A linguistic corpus is usually intended as a selection of sample texts repre- 
sentative enough of a language, and though the medical papyri at our disposal come from a random 
and incomplete selection, they can be considered as the entire reference population rather than as a 
sample of a larger group, so that linguistic annotation seems to me absolutely feasible. 

78 Cf. REGGIANI 2017, 178 ff., and M. Vierros in this volume. 

79 Cf. REGGIANI 2015 and 2016a; BERTONAZZI 2018b. 

80 Cf. MARAVELA — REGGIANI 2018. ROUED-CUNLIFFE 2014 described how digital encoding can prove 
useful for the analysis of grammar patterns of an ancient textual corpus (the Vindolanda tablets). A 
seminal project on annotating a corpus of private letters on papyrus, conducted by S. Porter and M. 
O'Donnell, has produced a number of valuable observations about modes and tenors of discourse, 
structures of information, semantic patterns, and so on (PORTER — O'DONNELL 2010). 

81 "[L]a possibilità di identificare alcuni papiri con trattazioni di un autore tramandato solo indiret- 
tamente inserisce tasselli nuovi nella complessa stratificazione della trasmissione indiretta, soprat- 
tutto quando sono i papiri i soli testimoni diretti di autori tramandatici per excerpta e citazioni (Apol- 
lonius Mys, Heras, Heliodorus, Herodotus Medicus)" (ANDORLINI 1997a, 22). 

82 ANDORLINI 2006 pinpointed the existence of an expressive strategy of medical technical texts: 
“L’osservazione di tali fenomeni, e del loro riproporsi costantemente nella tradizione dei testi medici 
greci su papiro, permette di riconoscere diverse fasi e livelli in cui il sapere tecnico contenuto nella 
ricetta medica veniva materialmente veicolato al lettore/consumatore attraverso moduli espresssivi 
e dispositivi tecnici, visivi, fisici, che formano una sorta di koiné, un tutt'uno tra lingua tecnica e 
scrittura speciale dei testi. Di qui la suggestione di rintracciare una specie di ‘gergo’ nei connotati di 
quel particolare linguaggio criptico, grafico ed espressivo, che comunica all'interno di una determi- 
nata categoria professionale: il medico, gli altri medici (i colleghi), il farmacista, il commerciante di 
farmaci, il paziente. Si tratta di modi speciali di usare parole e segni attraverso i quali le competenze 
medico-terapeutiche tendono a specializzarsi all'interno di una corporazione di addetti alla profes- 
sione medica" (p. 153). 

83 "l'analisi sintattica attraverso l'annotazione in un cosiddetto ‘treebank’ potrebbe mostrare più 
chiaramente la struttura del testo e facilitare il confronto tra il testo veicolato dal papiro e la tradizione 
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only to understand the exact meaning of a text (let us consider for instance the case 
of schematic prescriptions — e.g. P.Oxy. VIII 1088, http://litpap.info/dclp/63118 — 
where implicit verbs and asyndetic syntax would have to be made explicit). 

In the field of classical philology such linguistic analyses are now at a very ad- 
vanced level, but papyrology too has made important progress, with the project Se- 
matia, aimed at facilitating the linguistic tagging of the documentary papyri encoded 
in Papyri.info and described by Marja Vierros in this volume. Another possible way to 
linguistic annotation of the papyri is explored by Giuseppe Celano in the present book 
as well. The literary side has been unfolded by the Grammatically Annotated Philode- 
mus project, conducted by Daniel Riafio Rufilanchas and Holger Essler (Würzburg) 
and aimed at deeply annotating the Greek philosophical papyri from Herculaneum 
on morphological, grammatical, semantic, stylistic layers.^ Fragmentation is of 
course an issue when one decides to perform linguistic analysis: phrases, sentences, 
words are broken and it is not rarely difficult to understand the syntax, not to say to 
tokenize the words.” These are problems that digital tools must unavoidably face, 
and which an infrastructure based on multiple interconnected layers may feasibly 
overcome. 


manoscritta, soprattutto nel caso di papiri per cui si sospetti una possibile paternità” (BERTONAZZI 
2018a, 74). The case of surgical author Heliodorus is paradigmatic: “l’analisi del lessico tecnico dei 
papiri chirurgici ha portato a individuare paralleli testuali tra testo tramandato su papiro e tradizione 
manoscritta, talvolta significativamente stringenti come nel caso di P.Strasb. inv. 1187 e diversi passi 
di Eliodoro ap. Oribasio. Alcuni altri papiri (P.Lond.Lit. 166, P.Gen. inv. 111, P.Fuad.Univ. 1, P.Ryl. 
3.529), come già notato dagli studiosi, sono caratterizzati da una forte presenza di ‘lessico eliodoreo’ 
e da alcune peculiarità proprie del modus operandi del chirurgo, come la predilezione di interventi 
chirurgici che siano il più sicuri possibili per il paziente, nonché del modus scribendi, come il ricorso 
frequente alla prima persona - singolare o plurale —, la definizione con esattezza delle posizioni ‘to- 
pografiche’ della parte operata (dentro, fuori, sopra, sotto), e una sostanziale semplicità delle strut- 
ture sintattiche usate. Ad oggi, i tentativi di attribuire i papiri citati alla paternità di Eliodoro si sono 
basati quasi esclusivamente su criteri lessicali nel confronto tra il testo tramandato su papiro e sui 
capitoli di Oribasio che portano la titolatura ‘da Eliodoro’. Una nuova possibile strada offerta dalle 
nuove tecnologie della papirologia digitale è quella costituita dall’annotazione sintattica dei testi: 
un’analisi più accurata non solo del lessico, che come è noto è la parte più ‘volatile’ della lingua, ma 
delle strutture morfologiche e sintattiche dei passi del compilatore tardo in sinossi con i testi dei pa- 
piri, sia pure nella limitatezza delle pericopi testuali preservate, potrebbe gettare nuova luce anche 
su questo aspetto tra i più incerti quanto stimolanti della ricerca” (BERTONAZZI 2018a, 242-3). Marja 
Vierros has recently presented at the workshop “Act of the Scribe: Interfaces Between Scribal Work 
and Language Use” (Athens, April 6-8, 2017) some preliminary remarks on Applying Modern Author- 
ship Attribution Methods to Papyri and Ostraca (abstract at http://blogs.helsinki.fi/actofscribe/work- 
shop): cf. REGGIANI 2017, 185. 

84 Cf. REGGIANI 2017, 181; R. Ast and H. Essler in this volume. 

85 Cf. RIAÑO RUFILANCHAS 2014, 160—1; ESSLER — RIAÑO RUFILANCHAS 2016, 498; and the observations 
by R. Ast and H. Essler, M. Vierros, and G. Celano in the present volume. 
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Fig. 3: Sample treebanking of GMP II 10, medical letter (from REGGIANI 2015). 


4.6.2 Lemmatization 


An annotation layer of lemmatization, that is the reduction of a declined or conju- 
gated word to its original lemma, would prove essential in defining and analysing a 
specialised technical vocabulary like the one employed in the medical papyri, which 
has always been a relevant research focus of Isabella Andorlini’s concept of the med- 
ical corpus.* Such a sort of layer would represent an important bridge to connect the 
textual database to the related project Medicalia Online, consisting in an extensive 
lexical reference platform for ancient medical technical terms, as described by Isa- 
bella Bonati in this volume.” Systematic links to the lexical records (and the other way 
around) could contribute to create a dynamic lexicon® of medical technical terms in the 
Greek papyri. In addition, as Joanne Stolk observes in this volume, the possible deploy- 
ment of a lemmatization layer would help encoding linguistic variation more properly, 
while encoding lexical information would be helpful for the creation of word reference 
indices. 


86 Cf. ANDORLINI 1997a, 24. For more recent works on this topic see BONATI 2016a, 2017, 2018a, 2018c, 
and BERTONAZZI 2018a. For a parallel exploitation of digital encoding for the development of vocabu- 
lary analysis, cf. ROUED-CUNLIFFE 2014 apropos of the Vindolanda corpus. 

87 Cf. also BONATI 2018b and 2018d. On the connection between digital editions and Medicalia Online 
cf. also BERTONAZZI 2018a, 43-8 and 73-4, and 2018b. 

88 On “the interdependence of lexica and new editions” cf. ESSLER — RIANO RUFILANCHAS 2016, 492. 
ROUED-CUNLIFFE 2014 speaks of “integrated indexing”. 
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4.6.3 Abbreviations 


Abbreviations are another striking point. Medical writings (prescriptions above all, 
but not only) make a particularly extensive use of abbreviated words, developing a 
proper “graphical-expressive jargon";? given their technical nature, it would be ex- 
tremely useful to investigate their use, e.g. whether there is any underlying pattern. 
As to now, abbreviations are to be encoded in the same way as the documentary pa- 
pyri, that is according to the type of expansion — resolved or unresolved, distin- 
guished on the ground of the XML syntax. Resolved abbreviations are encoded as “ex- 
pansions”, with the <expan> tag enclosing the text spelled out and the <ex> tag 
enclosing the text abbreviated (Leiden+: double set of brackets, one enclosing the 
whole word and the other one enclosing the expanded abbreviation); unresolved ab- 
breviations are encoded as “abbreviations”, enclosed by the <abbr> tag (Leiden+: 
(|x|)). Any attempt to encode the type of abbreviation (e.g. by raised letter or by 
overline) is currently deprecated.? I strongly hope that in the future this level of an- 
notation may be taken into consideration, since abbreviating strategies are relevant 
for the correct transcription and interpretation of texts, as in P.Strasb. inv. 1187 
(http://litpap.info/dclp/59968), which 


exhibits two cases of allegedly abbreviated words that have been object of interpretative discus- 
sion. At ll. 11 and 14 two v overlined with a horizontal stroke (belonging to a plural genitive and 
a nominative respectively: -o are clearly legible; these strokes are abbreviation marks accord- 
ing to FAUSTI 1989, 158, contra MARGANNE 1998, 68, following ed.pr. for the latter, which supplies 
the v as omitted by the scribe, in angle brackets. The presence of the overline strongly suggests 
that we are indeed dealing with abbreviated words: therefore, though relying by rule on the more 
recent edition, [for the digital edition] it has been chosen to follow the editio altera, marking the 
abbreviations according to the current Leiden+ conventions, though preserving the reading of 
the editio tertia in an |ed| tag.” 


In the described case, a correct understanding of the abbreviation mark proves essen- 
tial in the text editing and encoding. Moreover, special ways of expressing combina- 
tions of characters or even entire words cannot be encoded but in the standard, sim- 
plified way: for example, to limit ourselves to the cases of P.Ant. II 127 
(http://litpap.info/dclp/65340) mentioned by CORAZZA 2018b, the sinusoid for ot and 
the peculiar sign -//- for gioi, which must be encoded as whichever symbol «ex- 
pan><ex>elol</ex></expan> (Leiden+: ( (£101))), losing interesting pieces 
of information. 


89 Cf. e.g. the case of the Antinoupolis papyri described in CORAZZA 2018b. 

90 Cf. ANDORLINI 2006. 

91 Cf. REGGIANI 2018b, and see L. Berkes in this volume. As he notes, text-image alignment could be 
a good compromise: but searching for the different abbreviation types would not be possible as well. 
92 BERTONAZZI 2018b; cf. BERTONAZZI 2018a, 67. 
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A tentative proposal, based on a preliminary survey conducted on the different 


abbreviation typologies in the Greek medical papyri and on the TEI/EpiDoc XML 
guidelines, envisions the following possible instances (apart from ‘traditional’, ‘sim- 
ple’ abbreviations):? 


Supralinear abbreviations (e.g. yaàkáv?(ov): PSI X 1180a iii,12). The superscripted 
letter may be tagged as any normal superscripted letter («hi 
rend="superscript"> tag); such a combination is already possible and 
correct in the current Leiden+ syntax, yet deprecated by the official guidelines. 
XML: «expan»xoAxéáv«hi rend="superscript">0</hi><ex>ou</ex> 


</expan>; L+: (xoAkóv|^9^| (ov) ). 

Abbreviations by stroke (horizontal: e.g. t0 (v), P.Mich. XVII 758 H verso,2; 
vertical: e.g. &|(npdv), P.Mich. XVII 758 H verso,3; slanting: e.g. yadBav/(n), 
P.Mich. 758 H,11; sinusoid: e.g. yiyvetj(at), P.Ant. III 127, i b, 6). The strokes may 
be encoded through the EpiDoc «am» tag (“abbreviation mark”)% and further 
defined as non-alphabetic glyphs (see below) as follows: «expan» «abbr» tõ 
«am» <g type="horizontal-stroke"/> </am> </abbr> <ex>v 
</ex></expan> = (t6 *horizontal-stroke* (v)) ;<expan><abbr> 
E<am><g type="vertical-stroke"/></am></abbr><ex>npdv</ex> 


</expan> = (€ *vertical-stroke* (npdv)) ; «expan» «abbr» 
XoABéáv «am» <g type="slanting-stroke"/> </am> </abbr> <ex>n 
</ex></expan> = (xoABÓóv*slanting-stroke* (n)) ; <expan> 
<abbr>ytyvet<am><g type="sinusoid"/></am></abbr><ex>al</ex> 
</expan> = (yiyvet *sinusoid* (at) ). Note that such combinations are 
correct in the current Leiden+ syntax, but the strokes need to be rendered 
properly in the HTML output; moreover, the <am> tag is not supported by the 
platform. 

Discontinuous abbreviations (e.g. p(z)1(à): MPER n.s. XIII 9, 1). This type of ab- 
breviation is already normally working in the SoSOL environment. «ex- 
pan»yu«ex»e«/ex»i«ex»ó«/ex»«/expan» = (u(e)tT(d)). 

Abbreviations by monogram (e.g. oyi(or00); mp(dc); xp(à)).? This type exploits 
the way in which monograms are marked up in EpiDoc and Leiden+,” i.e. a ‘g- 
type' with indication of the letters that are interwoven to form the monogram. 
<expan> «abbr» «am» <g type-"monogram"»oxi«/g» «/am» </abbr> 


93 In general, on abbreviations in papyri see e.g. CLARYSSE 1990, DEGNI 1999, and GONIS 2009; with 
special regards to documentary texts, BELL 1953 and BLANCHARD 1974; for literary papyri, MCNAMEE 
1981 and 1985. A typological work on the abbreviations in medical papyri has been preliminarily con- 
ducted by L. Iori and M. Centenari in the framework of the Corpus of the Greek Medical Papyri Online 
project (cf. http://www.papirologia.unipr.it/eventi/GDS/2010/centenari-iori.html). 

94 Cf. http://www.stoa.org/epidoc/gl/latest/trans-abbrevmark.html. 

95 On the relevance of the monogram yp(à) see ANDORLINI 2018. 

96 Cf. http://papyri.info/docs/leiden_plus s.v. “Non-alphabetical character with symbol”. 
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<ex>ox 1oTOÙ</ex></expan>= ((*monogram, oxl*oxtotow)) ;«expan» 
<abbr> <am> <g type="monogram"> up </g> </am> </abbr> <ex> 


noóc </ex> </expan> = ((*monogram,np*npéc)) ;<expan> «abbr» 
«am» <g type="monogram"> xp </g> </am> «/abbr» «ex» yxpóà 
</ex> </expan> = ((*monogram,yp*xp@)). This may apply to some 
symbols for units of measure too, e.g. Ai(tpa), o(v)y(xia), etc., which may make 
easier a systematic study of quantities and dosages in the ingredient use. Number 
digits and values can be easily encoded in the current way (XML: <num 
value="16">L1c</num>; Leiden+: <#1c=16#>). 


4.6.4 Linguistic variation 


The topic of linguistic variation is the most intriguing and difficult to handle. As noted 
above, traditional critical editions tend to overcome any fluctuation in favour of a re- 
constructed text, while fluctuations are actually fundamental for the phenomenology 
of the written text. Linguistic variation in the papyri has already been extensively in- 
vestigated by Joanne Stolk, who resumes her thoughts from the digital perspective in 
this same volume. I would like just to focus on some relevant points, to introduce the 
problem of linguistic variation in the medical papyri. The current markup of what I 
call textual fluctuations — handled by the <choice> tag, “indicating that [the read- 
ings] are two editorial versions of the same span of text, and should be read as alter- 
natives, not shown side by side”? — distinguishes between “corrections” of outright, 
well recognizable scribal mistakes (<corr> tag marking the correction, <sic> tag 
marking the original reading; Leiden+: < : correction | corr original: >) and “regular- 
izations” of phonetic misspellings (<reg> tag marking the regularization, <orig> 
tag marking the original reading; Leiden+: «:regularization | reg | original: »).?? 
Though the treatment of ‘regularizations’ has been improved during the history of the 


97 http://www.stoa.org/epidoc/gl/latest/trans-regularization.html: “It is most common to mark a 
regularization of this kind at the level of the whole word, rather than of individual characters affected 
[...]. This will make it easier to generate an apparatus reading for the regularized form (or the original 
form, depending on which you want to privilege), but it may also be impossible to identify individual 
affected characters in a dialect spelling or grammatical form. On the other hand, tagging the individ- 
ual characters might make it easier to index or search for specific features, such as the iotacism of t 
and et”. See J. Stolk's observation (in this volume) that apparatus regularizations/corrections work as 
textual equivalents and not ‘better’ substitutes of the original text. 

98 For some case studies cf. BERTONAZZI 2018a, 63-4. 
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papyrological databases, by moving the display of the original spelling from the ap- 
paratus to the main text and vice versa,” showing a stronger care for the phenome- 
nology of the papyrus text, and though the meaning of the <choice> tag points to 
alternative encodings of the same text portion, the fact is that we are still dealing with 
a differentiation between a form that is considered as ‘standard’ or ‘regular’ and a 
form that deviates from it. Phonetic fluctuations like opwpvn / Suvpvn in the medical 
papyri"? (but see also some relevant cases in the documentary papyri, like yorvpa / 
xv0pa'?! and &(oontpov / dountpov'™) show that not always is it easy to define what 
is the ‘conventional’ spelling and what is the ‘deviation’, so that a layer capable to 
align the ‘variants’ to each other, word by word, rather than categorizing them in a 
sort of hierarchy, may be much welcome.” 

Diachronic and synchronic fluctuations — depending on the evolutions and trans- 
formations of Hellenistic Greek language and on the rise of personal or geographical 
substandards™ — do occur in the medical papyri, but their existence not rarely points 


99 Inthe earlier Duke Databank of Documentary Papyri, “the conventional koine form is given first, 
followed by numbered braces enclosing the scribe's form or the edition’s misprint: e.g., óvopa 
{4wvopa}4 shows that the scribe has misspelled 6vopa, bnép {5umap}5 that he wrote epsilon over al- 
pha, aùtod {6avtw}6 that he miswrote dative for genitive, A00p {7A@0g}7 that the edition has a mis- 
print for A@ùp” (WILLIS 1984, 169-70); cf. REGGIANI 2017, 216. This slight prominence given to the 
‘standard’ form was retained in the first stages of Papyri.info, where it was included in the main digital 
text, whereas the original form as written by the scribe — marked with | regl, | corr| or the former 
|orth| tag (cf. REGGIANI 2017, 236 n. 119) — was displayed in the apparatus. As of September 2011, the 
two elements in the | reg | tag have been swapped with each other (cf. http://digitalpapyrology.blog- 
spot.it/2011/09/just-posted-to-papylist-dear-colleagues.html). This required a huge effort, because 
the ancient reading was originally transcribed diplomatically without spirits and accents, but its in- 
clusion in the text made it necessary to add them (cf. REGGIANI 2017, 224). 
100 “Nei papiri è scritto quasi regolarmente $u-” (ANDORLINI 1981, 61 n. 54), which conversely should 
be a ‘deviating’ spelling of ‘regular’ ouúpvn (cf. GIGNAC 1976, 121-2). 
101 Cf. BONATI 2015. 
102 On this peculiar double fluctuation cf. BONATI — REGGIANI 2018. 
103 Cf. e.g. BOSCHETTI 2007 apropos of philological variant alignment; further discussion in REGGIANI 
2018a. The current platform also allows for handling language shifts, i.e. the markup of a language or 
script different than the main document’s default. This is rendered with the tag <foreign> and the 
xml:lang attribute, the value of which may be grc for Greek words in a Latin text, 1a for Latin 
words in a Greek text, grc-Latn for Greek words in Latin characters, 1a-Grek for Latin words in 
Greek characters, and so on. In Leiden+ it is marked ~ | x|~grc and the like. If characters or lines in 
a different language or script are omitted by the editor, this is indicated with the «gap rea- 
son="ellipsis"> tag (see above) including a «desc» tag filled with the appropriate language 
(e.g. Coptic, Demotic; Leiden: (Lang: Coptic 1 line) etc.). Itis also possible to mark up cross- 
language equivalencies, for example giving the Greek correspondent of a Coptic term. For this task, 
the current system exploits the ‘regularization’ tag by adding an xm1: lang attribute, e.g. «choice» 
«reg xml:lang="grc"> óàpoxoc «/reg» «orig» apak </orig> </choice> (Leiden+: 
«:U0poxoc-grc|reg |Apak: >). The explanatory note goes into the apparatus accordingly. 
104 Cf. REGGIANI 2018a and 2018c. 
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to deeper levels of textuality, with reference to ancient literacy and intertextual rela- 
tions (see below), and therefore deserve a very peculiar attention. For example, in 
P.Aberd. 124, i= GMPI1, i (II cent. AD, http://litpap.info/dclp/63334), a papyrus frag- 
ment preserving chapter 37 of Hippocrates’ treatise De fracturis, at 1. 14, where all the 
codices (and the editions) have the ‘regular’ Ionic form mryeos, the papyrus shows 
clearly r]nyxews, the Koine form, which looks like an ‘interference’ of a typical ‘lin- 
guistic variation’ pertaining to the language of the documentary papyri, where it 
would be the standard form.'^^ Perhaps even more significant are the following cases. 
P.Oslo inv. 1576, a fragment of a catechism dealing with tumour-like diseases,’ 
partly overlaps with the text of P.Oxy. LXXX 5239 (both II-III cent. AD). The latter is 
more likely a ‘treatise’ than a questionnaire, as its editor David Leith notes (see below 
for this distinction), and the difference may be perceived from the lack of eistheseis in 
its questions. The scarceness of the surviving portions of text makes it hard to say 
whether the questionnaire derives from the treatise or they are two different outcomes 
of a same ascendant (see below for intertextual relations). As far as the extant parallel 
text is concerned, the wordings diverge from each other only for one variant: 
b6poxrjÀn (P.Oslo, 1. 5) vs [d]ypoxnAn (P.Oxy., 1. 15). The latter is usually considered 
as a minority variant (LSJ, quoting Poll. IV 203) of the former, used e.g. by Ps.Gal. 
Def.med. 424 = XIX 447,12-13 K., but it is in fact attested three times among the med- 
ical writers.!” Are we facing a trivialization in the Oslo papyrus, or a simple phonetic 
variant in the Oxyrhynchus papyrus, or just two different traditions bearing the same 
degree of ‘correctness’, attesting to a fluid notion of technical language? Moreover, in 
the following line of the Oslo papyrus (not paralleled by its Oxyrhynchus counterpart 
any more) we read ëpurp[ozgtëñ, which looks like a phonetic variant of £Avtpoei8r|c 
“lid-like”, *cover-like" (attribute of one of the membranes enveloping the scrotum). 
Rho for lambda is indeed a very frequent phonetic exchange in the language of the 
Greek papyri,’ but the same variation is to be found among the manuscripts preserv- 
ing Ps.Galen's Introductio seu Medicus, containing a descriptive passage (XIV 719,5- 
10 K.) of the same anatomical part.” A similar case is offered by P.Coll.Youtie I 4v 


105A comparable case is roig (and the following forms supplied accordingly) in P.Fay. 204,9 
(http://litpap.info/dclp/60181) vs Ionic voiot of the rest of the tradition of Hippocratic aphorisms. 
106 MARAVELA - LEITH 2007. The papyrus will be republished in the forthcoming third volume of the 
Papyri Osloenses. I am most grateful to Anastasia Maravela for sharing her drafts of the new edition 
and for discussing with me some textual and linguistic details. 

107 Orib. Syn.Eust. III 28, 6 and 9 = CMG VI 3, p. 75, 15-16 and 21 Raeder; Steph. In Hp. Progn. II 1 = 
CMG XI 1,2, p. 140,25 Duffy. The case resembles — mutatis mutandis — that of dountpov, the above- 
mentioned ‘deviant’ form of Ë(Uoortrpov “mirror”, for which BONATI 2016a, 246-8 already proposed 
the rank of substandard. 

108 Cf. GIGNAC 1976, 105. 

109 The previous editors corrected it in £puOpoei8o0c, but the newest Belles Lettres edition (PETIT 
2009) prints £Av1poet600G (XII 11, p. 40,1; see PETIT 2009, xcvi-xcix for the description of the manu- 
script tradition). Quite interestingly, the author of the treatise came possibly from Egypt (cf. PETIT 
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(http://www.litpap.info/dclp/64118), a collection of prescriptions dated around the 
III cent. AD, in which pdodg at 1. 8 (“reed bed”) could be a spelling variant of pÀotóç 
(“bark”) just as pAowv in Dsc. III 147 has a variant reading qAo0v.'? In a completely 
different type of text, P.Oxy. XIX 2221r + P.Kóln V 206r (http://litpap.info/dclp/61916), 
a I-century AD commentary to Nicander’s Theriaka attesting interesting textual vari- 
ants (see below), at 1. ii,29 we read Bopeitat vs Boteitat codd. (Nic. Ther. 394), which 
looks like the genuine form; the reading of the papyrus is a phonetic variant of the 
popeta to be found in the ancient scholia to that passage."' Once more time, the 
impression is that we are facing a peculiar intersection of multiple literacies, emerg- 
ing at the phonetic level but implying deeper meanings that cannot be flattened ina 
traditional apparatus. 

Even seemingly outright syntactic ‘mistakes’, in such a technical corpus as the 
medical papyri, can conceal deeper levels of meaning: an established prescriptive 
formula like b&wp xp@ “use with water” goes far beyond the apparent anacoluthon 
(£v 06a would be expected), becoming a distinctive mark of medical recipes, and 
must be treated accordingly.” 


A very tentative example of annotation of a medical papyrus: GMP H 10. 
fi = Tokenization, PoS = Part of Speech, H = Syntactic head, R = Syntactic relation [H + R + ID = 
fc al on]; N = Non n; E = Kinds of errors [VE = vowel exchange; BD = broken diminutive; ML 
= missing letter, EC = error corrected; CE = consonant exchange]; L = Lemmatization, TT = Technical terms; A = 
Abbreviations; M = (Diajcritical marks and punctuation [The other levels of annotation do not apply to this instance 
Note that PoS and syntactical taggings are simplified for reason of space] I am very grateful to Dr. Giuseppe G.A 
Celano for his essential help in the grammaticallinguistic annotation 


————— h — —— 
1 2 3 T 6 8 9 
T 
zėuyóv | por | t pov | xoi | tò xó | plov] 
pron | conj | art noun | pron | con 
da $ 
OB!CO | ATR 


ID 
[T 
PoS 
H 
R 
N | 
È 
L 
Ir 
A 
[M 
[m 


Y 
S [= 
> 


Evà.ucoy 
VE 


» irm [aom 


! 


slanting stroke slant stroke 


z 


Fig. 4: a former attempt to outline some annotation layers for Greek medical papyri (REGGIANI 2015). 


2009, 1—li), which suggests that the phonetic variation could have worked both ways. I discuss this 
and the preceding case in REGGIANI 2018a and 2018e. 

110 Cf. T.T. RENNER, ad loc. 

111 The same phonetic exchange p/q occurs elsewhere in the papyrus: see dugiogatva for 
aupioBatva in ii,9, 14, 15. 

112 Cf. ANDORLINI 2006, 163, and 2018; REGGIANI 2018a. 
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4.6.5 Transtextuality 


As we saw, quite often linguistic phenomena may be clues to broader cultural facts. 
The complexity of textual phenomena in the Greek medical papyri (but not only!) can 
be effectively described through the concept of transtextuality as investigated by Gér- 
ard Genette since the Eighties. Transtextuality defines all the various possible rela- 
tionships among texts (“all that sets the text in relationship, whether obvious or con- 
cealed, with other texts”) and encompasses several subcategories,! on which I will 
base the description of the next layers. 


4.7 Paratext 


Paratextuality is defined as the relation between one text and what surrounds the 
main body of the text: in Genette’s theory, paratext is mainly composed of titles and 
headings, but we may add any other graphical device that comes along the text itself, 
including punctuation, which is not a common feature in papyrus texts and therefore 
deserves special treatment.” 

In a writing system based on scriptio continua (i.e. not separating letters into 
words), punctuation is a way of facilitating the reading by separating words or groups 
of words. Single, double, triple dots occur irregularly with this function; in TEI/EpiDoc 
they are encoded as “non-alphabetical glyphs” with the tag <g> and the attribute t ype 
defining their nature (e.g. «g type="middot"/>, «g type="dipunct"/>, 
<g type="tripunct"/>). In Leiden+, the so-called ‘g-types’ are encoded by typing 
the attribute name between two asterisks. This is usually how other lectional signs 
(apostrophe, diastole, stigmai) and all graphical devices (check marks, deletion marks, 
parentheses, line fillers, strokes) work in this markup: a full list of what is supported by 
Papyri.info can be found at http://147.142.225.252/paptrac/ wiki/gtypes, while new signs 
are being developed specifically for the DCLP.'6 

A small set of other signs, separating not letters or words but entire text sections, 
is encoded as ‘milestones’:’’ this is the case with paragraphos («milestone 


113 GENETTE 1992, 83, then GENETTE 1997, 1. 

114 Cf. GENETTE 1992, 83-4, later developed in GENETTE 1997, 1-7. 

115 On punctuation in the papyri cf. the overviews by TURNER 1987, 7-10, and CRIBIORE 1996, 81-3. 
For specific issues cf. DEL MAsTRO 2017 (Herculaneum papyri) and FUNARI 2017 (historical fragments). 
For the particular care for paratext in the digital editions of literary papyri see the notes by R. Ast and 
H. Essler in this volume. 

116 Onancient punctuation and encoding/annotating issues see the article by G. Celano in this vol- 
ume. On filling marks in the papyri cf. BARBIS LUPI 1992. On diacritical and lectional signs see now 
also NODAR DOMÍNGUEZ 2017 and MCNAMEE 2017. 

117 Cf. http://www.stoa.org/epidoc/gl/latest/trans-textpart.html for the difference between Divs 
(structural text parts) and milestones (non-structural text parts). A possible issue is that in fact e.g. 
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rend="paragraphos" unit="undefined"/>), which has been supported by 
Papyri.info since the beginnings, and with the new additions developed for DCLP, i.e. 
koronis («milestone rend="coronis" unit="undefined"/>) and forked 
paragraphos a.k.a. diple obelismene (<milestone rend="diple-obelismene" 
unit-"undefined" /»).'5 They are all placed between any two lines of text and in 
Leiden+ are encoded as rows of four typographical characters depending on the sign 


tions of dash + angle bracket for diple obelismene: ->->->->). 

The diple has been categorized as ‘milestone’ as well (<milestone rend="di- 
ple" unit="undefined"/>, Leiden+: >>>>), though this may create some is- 
sues, since diplai are frequently used in the margins to highlight a specific section, 
phrase or word, and encoding them as milestones would not be semantically cor- 
rect.!? A similar issue may arise when the paragraphos is used between two lines to 
separate two sections of a text but the logical division occurs within the preceding or 
the following line, as in PSI VI 718 = SB XXVI 16458 (IV AD, http://lit- 
pap.info/dclp/64564). This sheet, likely cut off from a small parchment notebook, 
contains part of a collection of prescriptions separated from each other by inline fill- 
ing marks and interlinear paragraphoi. The last recipe starts in 1. 12, following the end 
of the preceding one and after a separator mark, though the paragraphos is traced 
between ll. 12 and 13. 


10 | 865 éxAiwetv kai TO 
Tré(trept) ef Te &ocovos u[n] 


AaAcv / (*) campóv ofi-] 


vov Troinde(*) KaAdv [ -ca.?- ] 


Fig. 5: SB XXVI 16458,10-13 


«lb n="12"/>\AaXév «g type="check"/></lem><rdg><choice><corr><expan> 
mé<ex>m</ex>e<ex>ol</ex></expan>itéa<supplied reason="omitted">c¢ 
</supplied> qoAoiócg«/corr»«sic»ng£ei(reoQovocg«/sic»«/choice» y«supplied 
reason="lost">a</supplied><lb n-2"12" break="no"/>Aax@v  «expan»«ex» 


paragraphoi may actually mark a subdivision between structural text parts, but the underlying ra- 
tionale in the papyrological markup seems to be that they represent graphical separators or turning 
points marked by the ancient scribe. 

118 On these peculiar signs cf. BARBIS LUPI 1994 (paragraphos), BARBIS LUPI 1988 (diple obelismene), 
SCHIRONI 2010, 16-18 and passim (koronis). 

119 A «ni» tag would probably be better: see below the case of eisthesis/ekthesis. 
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dpaxpac</ex></expan> «num value="6">ç</num>.</rdg></app> oampdv o 
«supplied reason="lost">Î</supplied> 


<milestone rend="paragraphos" unit="undefined"/> 


«lb n="13" break="no"/>vov <choice><reg>notfioalt</reg><orig>nolnoe 
</orig></choice> xKaddv «gap reason-"lost" extent="unknown" unit= 
"character"/> 


In such cases, the sign is reproduced as in the original text, but the semantics is odd; 
moreover, word breaks between two lines separated by a paragraphos seem not to be 
handled by the searching engine. It is quite clear that rigid textual units lose rele- 
vance when one deals especially with technical texts, and a separate layer to record 
the paratext in all its multifarious relations with the text may prove useful.” 

Blank spaces are a particular category of paratextual devices that deserves a thor- 
ough reflection. If the main purpose of punctuation is to divide text portions, then it 
is possible to think that any “space deliberately left blank [inside a text] is also to be 
considered as a mode of punctuation”.! The traditional way of referring to deliberate 
blank spaces is the vacat, which is rendered with a «space» tag in TEI/EpiDoc (with 
the very same attributes as the <gap> mentioned above). Of course, encoding recur- 
ring blank spaces like those deployed in P.Col. IV 122 (official letter, 181 BC, http://pa- 
pyri.info/ddbdp/p.col;4;122) to separate almost every word from one another would 
be impossible, if not in a different layer than text. Normally, the vacat is to be marked 
up when it introduces a significant break in the text.” Peculiar cases as in P.Oslo III 
72,9 (medical treatise about epilepsy, II AD, http://litpap.info/dclp/63583), where 
(according to the editors' interpretation) the ancient scribe left a blank gap to pin- 
point a controversial point, should be further (or differently) annotated in order to 
preserve the original intentions.” 

The handling of blanks is connected to the problem of how to encode ekthesis and 
eisthesis, i.e. extension and indention of a line with the purpose of highlight particular 
phrases, which has never been taken into consideration before for documentary papyri. 
Among the medical texts, this device is frequently deployed in the questionnaires or 
catechisms. Such a text typology provides medical notions in a dialogue format, 
where a question about theoretical definitions or practical procedures is followed by 


120 Diacritics and lectional signs added by different hands are another case of uneasy elaboration. 
121 TURNER 1987, 8; cf. also CRIBIORE 1996, 83 (*Blank spaces can be used as punctuation"). 

122 Vacat can be used also to render columnation in particular layouts (lists, accounts, etc.), but the 
use is not standardized. See the chapter by L. Berkes in this volume for some remarks on the markup 
oflayout in documentary papyri. Very recently, DICKEY 2017 has dealt with particular layouts of bilin- 
gual texts, where the columns are handled with blank spaces. 

123 This case is currently encoded as an editorial apparatus note displaying the omitted text. 
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a more or less detailed answer.™ Its use as a handbook, a reference tool for the doc- 
tors’ preparation, is clear also from the complex set of devices employed to highlight 
the articulation of the text: questions are very often indented in eisthesis, and further 
marked with paragraphoi, line fillers or other lectional marks that introduce the an- 
swers as well. This mise en page reflects the central role played by the question-and- 
answer structure of the didactical tool,” and must be preserved when the texts are 
moved to any modern format. This is not only a matter of reproduction. In the overall 
framework of difficulty of recognizing textual genres! due to the fragmentary state 
of the scattered sources preserved to us, scholarship relies on any possible feature for 
a better understanding of ancient texts, and some very fragmentary texts have been 
identified as questionnaires just on the basis of the presence of blank spaces (P.Ox- 
ford Sackler s.n., II century BC; more recently GMP I 6 and P.Strasb. inv. 849):!8 it 
is therefore unconceivable to encode such texts without paying attention to their par- 
atextual garment. 

It is tempting, at a first stage, to equate an eisthesis to an initial vacat and there- 
fore to encode it like that. However, as we have to encode not the visual appearance 
of the text but its semantic core, we must be aware of the fact that we are not describ- 
ing a certain extent of space intentionally left without characters, but a displacement 
of the line beginning to stress its relevance.’” Its specular counterpart, ekthesis, 
makes the picture clearer: by no means can it be indicated by creating weird virtual 
vacats at the beginning of the surrounding lines. The current solution is to mark it as 
an attribute of the line: «1b n="1" rend-"indent"/», which in Leiden+ appears 
as (1, indent) - the same way marginal annotations are tagged (see below; for 
ekthesis the value "outdent" is to be used).?? This seems to work fine, and is now 
fully supported by the DCLP platform also in terms of visual display. 


124 Cf. REGGIANI 2016b with earlier bibliography; also BONATI 2018e. 

125 Cf. ANDORLINI 1999; REGGIANI 2018h. 

126 Cf. ANDORLINI 1997b, 159, and see above. 

127 Cf. BARNS 1949, 4-5. Online: http://litpap.info/dclp/65633. 

128 Cf. HANSON — MATTERN 2001, 72 and MAGDELAINE 2004, 63, respectively. Online: http://lit- 
pap.info/dclp/69007 and http://litpap.info/dclp/69028. 

129 Onthe ecdotic relevance of line displacement in the system of the margins of the Greek literary 
papyri see SAVIGNAGO 2008 (cf. also TURNER 1987, 8). 

130 In medical papyri ekthesis issomehow less frequent than eisthesis; a significant case is presented 
by CORAZZA 2018a. See also P.Oxy. XIX 2221r + P.Köln V 206r (http://litpap.info/dclp/61916), the afore- 
mentioned commentary to Nicander’s Theriaka, where the lemmas containing the commented pas- 
sages are highlighted by ekthesis (cf. ANDORLINI 2000, 39) and the comments are introduced by larger 
blank spaces, which might be considered as eistheseis. 
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1 [-ca.?-]Traces ca. 3 characters[ -ca.?- | CCIE PE JE YN Ds ESA 
[ -ca.?- ]vurroettak[ -ca.?- ] -ca.?- ] Traces ca. 3 characters[ -ca.?- ] 
[ -ca.?- ] - ca.9 - [ -ca.?- ] -ca.?- ]vurroettak[ -ca.?- ] 
Tí gotw x[orri;] -ca.?- ] - ca.9 - [ -ca.?- ] 

[ñ Tv] ewpé&ruov Touń. ti gotw k[orrfi;] 

Tí éotw &rrobopá; Lo ñ TV] cwpaTwv TOUT. 

[ñ 6U& Tóvwv Kai Guévw(v) ti got kTro5op&; 
[owp]éTwv SiaoTactc. ñ 8U& Tévwv Kai ópévox(v) 
ti got StaKévtTn- owp)atwv SiaoTaor. 
otc; ti šoTuv Btakévrn- 
[oTe]vm(*) Stà B£Xóvnç roov OG; 

[o]wyatwv Touń. ore]vn(*) Stà BeAOvng Tv 


ti gore Siapapy;(*) o]wpatwv Toun. 


TI got Stapapn;(*) 
5]takévrnoto tà BeAó- 
v]nc kal b&pporoc ñ pi- 
tov Stapepopévou KATÈ 
Ti]oAA&c £rreiBouA&c.(*) 

rróca([t] Sta~opai otAnviw(v); 
T]& r&v omànvíwv tapé- 
plovor B- ñ(*) Ta\pà/ thy “Anv, 
ñ kai](*) [mapà ró oxfpa. 

[ -ca.?- ](*) thv ÓAnv; 


[B]vaxévtnorg rà BeAÓ- 

[v] nç kat p&uuatog ñ pi- 

[r]ou S:apepouévou kata 
[m]oXAkg ErrerBouAde.(*) _ 
róca[u Suaq@opai ottAnviwlv); 
[T]& Twv omànviwv rapé- 
[pJoucx B- ñ(*) ma\pà/ THY U(MAnv, 
[ñ «ai](*) [rr]ap& TO oxñpa. 

[ -ca.?- }(*) tiv BAny; 


Fig. 8a: visualizzazione del testo nel PE) (Fig. 8b: visualizzazione del testo in DCLP) 


Fig. 6: A nice comparison between the earlier SoSOL preview display and the current DCLP rendering 
of eisthesis in P.Gen. inv. 111, catechism, http://litpap.info/dclp/63819 (BERTONAZZI 2018a, 42). 


However, a further problem arises if we consider that in some catechisms the ques- 
tions do not start in a new line, but on the same line as the end of the previous an- 
swers, after a blank space that cannot be considered as a vacat for the same reasons 
as above. In this case, if we tagged the entire line as in eisthesis we would not repre- 
sent the situation correctly, since the first part of the line is not really indented. A new 
solution might be to tag the question phrase with the TEI/EpiDoc XML «nhi > element, 
which is used to sign “highlighted characters or words", “with a rend attribute spec- 
ifying the kind of highlighting",?! In our case, the value of the rend attribute would 
be"eisthesis",ie.«hi rend="eisthesis"></hi> (or "ekthesis" in the 
other case), which is not supported by SoSOL currently.'? 


131 Cf.http://www.stoa.org/epidoc/gl/latest/trans-charactershighlighted.html. 

132 Cf. REGGIANI 2018h, where I advanced a further distinction of eistheseis according to their appear- 
ance in the texts. It is worth noting that encoding eisthesis/ekthesis as a «hi» element would prove 
helpful also when handling whole indented/outdented paragraphs (see the sample in the Conclu- 
sions below), instead of marking each line. 
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The <hi> tag handles also the markup of ancient diacritical signs originally 
added by the scribe,'? which has always been well developed since the earlier times 
of Papyri.info. The canonical cases are accents (acute, circumflex), spirits (lenis, as- 
per), diaeresis, either alone or in combination, marked with a rend attribute, the 
value of which corresponds to the name of the sign. Leiden+ markup is rather intri- 
cate (they must be added in the proper Unicode character inside a pair of brackets 
just after the appropriate letter, which in turn must be always preceded by an extra 
space, in whichever position it occurs in the word), but fortunately the editorial plat- 
form offers quite a helpful menu to automatically perform the task. It is worth noting 
that the presence of ancient diacriticals is noted in the apparatus. 

The occurrence of images within the text is currently handled as well. The <fig- 
ure» tag is used, and a free description of the picture can be inserted in a nested 
«figDesc» tag; Leiden+ simply indicates it with the free description preceded by a 
hash mark. In medical papyri this proves quite useful when dealing with the cases of 
illustrated herbals, where the extant images can be easily encoded with #plant (= 
<figure><figDesc>plant</figDesc></figure>).! 


4.8 Intertextuality & hypotextuality 


Intertextuality is defined as the relation between parallel text, in the form e.g. of quo- 
tation or allusion;'5 hypotextuality (with its opposite, hypertextuality) as the relation 
between a text and a preceding one that is transformed, modified, elaborated or ex- 
tended. Due to the high degree of both theoretical and practical re-elaboration of 
medicine — “reperformance”, in a sense, to borrow a term created to describe the in- 
terplay between text transmission and representation in classical drama? —, medical 
papyri show a complex degree of both inter- and hypotextuality. Not only are the 
‘classical’ medical treatises and handbooks copied following the original text, but 
they are also quoted, or referred, or re-elaborated in other writings" (anonymous 
treatises as well as manuals, catechisms or collections of prescriptions, and of course 
commentaries) and excerpted by the late compendiasts (Oribasius, Aetius, Paul of 
Aegina), who took and interwove excerpts from the earlier authors in order to create 
composite texts, with the purpose of assembling the best from previous writings."? 


133 On this typology of signs cf. TURNER 1987, 10—12; CRIBIORE 1996, 83-8; COLOMO 2017; AST 2017. 
134 P.Tebt. II 679 + P.Tebt.Tait 39-41 (II AD): http://litpap.info/dclp/63596; P.Johnson + P.Ant. III 
214 (IV-V AD): http://litpap.info/dclp/64598 (cf. REGGIANI 2018i). 

135 Cf. WORTON - STILL 1990; POLACCO 1998; BERNARDELLI 2000; BERNARDELLI 2010, esp. 9-62. 

136 FINGLASS 2015. 

137 On the conceptofintertextuality applied to ancient quotations cf. BERTI 2012, part. 439—46, about 
ancient historians. 

138 Cf. HANSON 1997,296; ANDORLINI 1997a, 19—20. 
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The interconnection between all such parallel or derived texts is of the utmost 
importance for evaluating the history of medical science, the dynamics of ancient tex- 
tual transmission, and the framework of literacy among medical experts, so that an 
annotation layer that may link the actual text on the papyrus to any relevant related 
passage in other sources would be most useful.’” In the cases of papyri preserving 
‘literary’ works (Hippocrates, Galen, etc.), for example, our fragments quite often do 
provide more genuine readings than manuscript tradition, since they are chronolog- 
ically closer to the source;!° they can therefore support some manuscript versions 
against others, or even preserve previously unattested variants, facts that deserve a 
particular attention. 

A small selection of significant samples will suffice. In P.Oxy. XIX 2221r + P.Kóln 
V 206r (http://litpap.info/dclp/61916), the abovementioned I-century AD commen- 
tary to Nicander’s Theriaka, the extant quoted passages generally agree with the more 
recent manuscripts of Nicander’s tradition (= w) against the ancient codex Parisinus 
II, and show also new genuine variants.’ The comments, in turn, do not show many 
points in common with the known scholiastic tradition, and may be traced back to 
the most ancient comment to Nicander’s Theriaka, that by Demetrius Chlorus.! In 
the Aberdeen Hippocratic papyrus (GMP I 1), already cited with regards to the adap- 
tations to the Greek language spoken in Egypt, we do find variants already attested 
in the manuscript tradition (11. 4-5) but also passages completely divergent from the 
codices (ll. 11-12, where the length of the gap and the shape of the following traces 
exclude the unanimous manuscript tradition, which is of course printed in all the edi- 
tions, in favour of a previously unattested variant). 

Alignment among parallel versions of the same text, by linking external resources 
providing canonical literature,“ can therefore convey precious information and sig- 
nificant analysis tools, and can be well extended to all the cases (even documentary 


139 So far, this has been possible only in the line-by-line commentary: cf. BERTONAZZI 2018b and Co- 
RAZZA 2018a for discussion and case studies. 

140 Cf. ANDORLINI 1997a, 22 n. 15 and 23. 

141 E.g. ii,12 rÀégt Óykoc vs méAEt Óykoq w & Gow-Scholfield, méAet óAóg II & O. Schneider (Nic. Ther. 
387). According to COLONNA 1954, the original text could have been mAget OAK6G, subsequently popu- 
larized in néAet dAkég and glossed with Óykoc. 

142 Full analysis in COLONNA 1954. 

143 Cf. ANDORLINI 2001. For another Hippocratic papyrus preserving an interesting and complex tex- 
tual history (P.Ant. III 184, VI AD, http://litpap.info/dclp/60192) cf. HANSON 1970; in particular, "the 
sequences of the Hippocratic texts do not correspond to the one established in medieval tradition but 
seem to follow autonomous criteria" (CORAZZA 2018b, 174). 

144 While digital repositories of literary texts do exist, they usually do not record all manuscript var- 
iants of the texts (see above); they could well be connected to the papyrus texts but information would 
be partial. A possible solution may be to create multitextual digital editions of the literary texts. It is 
important, of course, to distinguish parallel passages in copies of the same text from quotations em- 
bedded in different texts; for the latter, the current platform offers the possibility to deploy the <q> 
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ones) of texts preserved in more than one item (copies, duplicates). It would also 
solve the problem of encoding philological variants and manuscript readings in the 
papyrological digital editions, a challenge faced during the construction of the DCLP 
database and not yet satisfactorily solved. Alignment of ancient or modern transla- 
tions of the medical texts (e.g. Latin or Arabic versions) should also be taken into con- 
sideration,“ while a fruitful integration between syntactic annotation/analysis and 
intertextual referrals may be envisioned for the most intriguing issues of medical lit- 
erature, as recently argued by Francesca Bertonazzi: 


l'analisi del lessico tecnico dei papiri chirurgici ha portato a individuare paralleli testuali tra 
testo tramandato su papiro e tradizione manoscritta, talvolta significativamente stringenti come 
nel caso di P.Strasb. inv. 1187 e diversi passi di Eliodoro ap. Oribasio. Alcuni altri papiri 
(P.Lond.Lit. 166, P.Gen. inv. 111, P.Fuad.Univ. 1, P.Ryl. 3.529), come già notato dagli studiosi, 
sono caratterizzati da una forte presenza di ‘lessico eliodoreo' e da alcune peculiarità proprie del 
modus operandi del chirurgo, come la predilezione di interventi chirurgici che siano il piü sicuri 


tag marking quoted phrases (Leiden+: quotation marks + space), which can be easily used to differ- 
entiate the appropriate cases. 

145 Traditionally, documentary papyri preserving the ‘same’ text in multiple copies (for a catalogue 
of duplicates cf. NIELSEN 2000) are treated in the ‘philological’ way, i.e. collated and merged in one 
source archetype: e.g. http://papyri.info/ddbdp/p.tebt;3.1;771dupl (note the suffix ‘dup!’ added to the 
URL of the digital text, which advises about the existence of a duplicate of the papyrus). However, a 
certain degree of uneasiness is felt about such a practice , see e.g. in Jelle Stoop's words: “I disagree 
with this editorial choice for two reasons. First, in a field like papyrology, every copy of a text deserves 
full consideration and [...] an archetype that would somehow be considered more authentic than a 
later copy is an editorial fancy. Copies of the same text, however similar, were written with a purpose 
in mind, so that edition should be more rather than less interesting. Second, in order to appreciate 
the fact that we have multiple copies [...], we must ask why different versions of it exist in the first 
place. The interest of these documents is, therefore, not restricted to the text alone, but extends to the 
life and afterlife of its copies in relation to one another. In sum, the text of just one fragment does not 
make for a satisfactory edition of understanding of this [text]. By editing the texts in their own right, 
we learn about the convention of [...] writing in [Graeco-Roman] Egypt” (STOOP 2014, 185). A new ‘phe- 
nomenological’ consideration of papyrus copies is emerging (cf. YUEN-COLLINGRIDGE — CHOAT 2012, 
with interesting preliminary comments on textual differences between copies of the same document), 
but, for now, the digital database is following the ‘philological’ practice, with a significant loss of 
information. Giuditta Mirizio (Bologna) is currently working on this topic also from the perspective of 
digital encoding and XML annotation. On this topic, see also below. 

146 The proposed tag («app type="variant">, Leiden+ |var|) raised some theoretical and 
methodological issues, for example whether to choose just one manuscript variant or to encode all 
possible instances. Moreover, the «app» tag (see below) typically envisages a “lemma” («1em») part, 
which corresponds to the word(s) in the text, and one or more “reading” part(s) («rdg»), correspond- 
ing to the alternative(s) in the apparatus, and it must be clearly thought how this should work in the 
case of philological variants. Currently, the Digital Corpus of the Greek Medical Papyri has adopted 
the solution to just describe the most relevant manuscript variants in the line-by-line commentary. 
147 On translations in the tradition of ancient Greek medical texts see e.g. GAROFALO — FORTUNA — 
LAMI — ROSELLI 2010. 


38 — Nicola Reggiani 


possibili per il paziente, nonché del modus scribendi, come il ricorso frequente alla prima per- 
sona — singolare o plurale -, la definizione con esattezza delle posizioni ‘topografiche’ della parte 
operata (dentro, fuori, sopra, sotto), e una sostanziale semplicità delle strutture sintattiche usate. 
Adoggi, i tentativi di attribuire i papiri citati alla paternità di Eliodoro si sono basati quasi esclu- 
sivamente su criteri lessicali nel confronto tra il testo tramandato su papiro e sui capitoli di Ori- 
basio che portano la titolatura ‘da Eliodoro’. Una nuova possibile strada offerta dalle nuove tec- 
nologie della papirologia digitale è quella costituita dall'annotazione sintattica dei testi: 
un’analisi più accurata non solo del lessico, che come è noto è la parte più ‘volatile’ della lingua, 
ma delle strutture morfologiche e sintattiche dei passi del compilatore tardo in sinossi con i testi 
dei papiri, sia pure nella limitatezza delle pericopi testuali preservate, potrebbe gettare nuova 
luce anche su questo aspetto tra i più incerti quanto stimolanti della ricerca.!48 


Re-elaboration is probably the most striking feature of technical texts, stemming from 
oral teaching and then continuously adapting their content according to the develop- 
ments of knowledge. Medical genres like the questionnaire or the collection of pre- 
scriptions illustrate this framework at the best, though we do find plenty of cross- 
references in treatises too.’ Catechisms (erotapokriseis), for example, are clearly de- 
rived from and devoted to some sort of oral teaching, as we pointed out above while 
discussing of their paratextual devices. Yet there exists a considerable similarity with 
the literary genre of the “definitions”, connected with the research and teaching prac- 
tice of Hellenic medicine and attested in the Greek Pseudo-Galenian treatise Horoi or 
Definitiones Medicae (XIX 346-462 Kühn) and in the Latin Pseudo-Soranian Quaes- 
tiones medicinales.™ In fact, David Leith has recently distinguished two types of ques- 
tion-and-answer medical texts: the proper catechisms, being introductory manuals 
for the student of medicine, and wider treatises on remedies. The suggestion came 
from the similarities detected between erotapokriseis on papyrus like P.Turner 14 
(http://litpap.info/dclp/63560) and PSI inv. 3783 (http://litpap.info/dclp/63244) and 
the excerpts from the physicians Herodotus and Antyllus preserved in Oribasius’ 
Collectiones Medicae.™ One may also recall the similarities between the abovemen- 
tioned P.Oslo and P.Oxy. overlapping questionnaires, or between the surgical cate- 
chism P.Gen. inv. 111 (http://litpap.info/dclp/63819) and the treatise known as Cirur- 
gia Heliodori,^ or also between P.Aberd. 11 (http://litpap.info/dclp/63332) and 


148 BERTONAZZI 2018a, 242-3. 

149 Cf. e.g. ANDORLINI 2014. “La possibilita di identificare alcuni papiri con trattazioni di un autore 
tramandato solo indirettamente inserisce tasselli nuovi nella complessa stratificazione della trasmis- 
sione indiretta, soprattutto quando sono i papiri i soli testimoni diretti di autori tramandatici per ex- 
cerpta e citazioni (Apollonius Mys, Heras, Heliodorus, Herodotus Medicus)” (ANDORLINI 1997a, 22). 
150 On which cf. KOLLESCH 1963 and FISCHER 1998 respectively. For general considerations about cat- 
echisms on papyrus see also BONATI 2018e and BERTONAZZI 2018a, 57-62 , as well as REGGIANI 2016b. 
151 LEITH 2007; cf. already ANDORLINI 1997b, 160. 

152 Cf. MARGANNE 1986 and now BERTONAZZI 2018a, 237-8. 
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P.Ross.Georg. I 20 (http://litpap.info/dclp/63569), two ophthalmological catechisms 
that certainly derive — with variations — from the same source.!? 

Prescriptions are even more complex (and fluid) in transtextual and hypotextual 
relations. I have extensively dealt with transmission of ancient medical recipes else- 
where, where I outlined the articulated route from oral compositions and draft tran- 
scriptions to professional exchange and collection.^* Medical prescriptions are frag- 
mentary units, which stem from diagnostic-therapeutic practices and oral knowledge 
that are recorded on wax tablets (pinakes), first kept at the sanctuaries of the healing 
gods, then collected by leading physicians (namely Hippocrates) in order to build sys- 
tematic medical repertories.'° At this stage it is hard to trace any actual intertextual 
relation, but when - seemingly in the early Roman age - prescriptions start circulat- 
ing among the physicians, the plot gets intricate. Professional doctors exchange sin- 
gle recipes on papyrus scraps with each other and collect those fragments of medical 
knowledge into lists and catalogues on parchment booklets, deploying a set of par- 
atextual devices to preserve the unity of each prescriptive text. Galen is the best wit- 
ness to this ‘research’ activity,’ as well as of the ‘philological’ attention to the phar- 
macological books held by the libraries, which he himself consulted and collated to 
get the most exact versions of the texts and to compile his famous treatises on the 
composition of remedies.” 

This workflow is by no means exhausted with Galen: among the numerous pos- 
sible examples, P.Berl.Móller 13 (http://litpap.info/dclp/64268) is a stunning in- 
stance. This papyrus, a comparatively large portion of a roll from Hermoupolis 
Magna, dated between the late III and the early IV century AD, is written on the recto 


153 Cf. BERTONAZZI 2018a, 236-7, with earlier bibliography. 

154 REGGIANI 2018g and 2018j. 

155 Cf. TOTELIN 2009a, part. chapters 1-3. 

156 Comp.med.loc.11 = XII 423,13-15 K. (a recipe is found in a dead physician’s parchment notebook 
and then forwarded to Galen); Antid. 15 = XIV 31,10-15 K. (exchange of recipes); Indol. 33-5 (his own 
personal collection of worldwide prescriptions, destroyed by the AD 191 fire). See also P.Mert. I 12,13- 
24, attesting to the very same activity of exchange between two colleague physicians in Egypt. 

157 Theancient practice of collating several copies (antigrapha) of medical texts is attested above all 
by Galen, who noted several degrees of manuscript divergences, ranging from small linguistic varia- 
tions to major discrepancies in the content, e.g. in the ingredients and quantities (cf. ANDORLINI 2000, 
38-9; ANDORLINI 2003, 14—15; TOTELIN 2009b; BONATI 2016b, 64—5), but we know of other cases in 
which the ancient readers produced ‘personal’ copies that became, by means of reformulations and 
abbreviations, new recensions of the same text (cf. ANDORLINI 2000, 37-8). In some cases it is possible 
to speak of erroneous or inaccurate deviations from the original (it is the case with Galen's treatises, 
for which the ancient author himself stigmatised the circulation of incorrect versions of his own 
books: De libris propriis II 91-3 Müller = XIX 8-11 K.; cf. HANSON 1985, 43—5) but in other cases it is 
difficult to go back to a genuine text (HANSON 1985, 34—5 makes the example of Hippocratic letters). In 
general, on Galen’s ‘philological’ work cf. HANSON 1998; ANDORLINI 2003, 15-16; DORANDI 2014; BONATI 
2016b, 63-5. 
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along the fibres, therefore purposely produced as a collection of medical prescrip- 
tions, of which only two columns survive. The first one contains a single prescription 
“to prevent hair loss on the head”, identified by MARGANNE 1980 as a prescription as- 
cribed by Galen to Heras of Cappadocia, a pharmacologist active between 20 BC and 
AD 20. The text on the papyrus parallels Gal. Comp.med. loc. XII 430,8-15 K. verba- 
tim,” while other variant versions of the same remedy are recorded by Galen himself 
(ibid. XII 435-6 K.) as antecedents of Heras’ one.’ Subsequently, CORAZZA 2016 dis- 
covered that also some remnants of the second column can be identified with other 
recipes by Heras, this time against headache, mentioned by Galen as well, with some 
wording variants.’ Two of them patently parallel Galen, but the papyrus is by no 
means a copy of On the composition of medicaments by places: the recipes do not fol- 
low the canonical order in which they are cited in Galen’s treatise, clearly attesting a 
work of selection, extraction, and thematic re-arrangement, in which each recipe is 
treated as a unit to be managed on its own; moreover, the other two identified pre- 
scriptions look like variants of Heras’ texts as reported by Galen, thus attesting a 
‘fluid’ stage of transmission, in which recipes are modified and adapted according to 
the users (Galen himself, as we saw, attests some earlier versions of Heras’ recipe 
against hair loss). It is apparent that this interconnection of “living texts”,! copied 
and re-copied from original pieces or different collections, generates cross-references 
and inter-quotations that may well fall into the cases described in these paragraphs.” 


4.9 Metatextuality 


Metatextuality is the explicit or implicit critical commentary of one text on another 
text. For the same reasons described above apropos of paratext and intertextuality, 
namely the fluidity of medical technical texts, always subject to renovation and up- 


158 In fact there are some interesting variants, which as usual show how papyri can contribute to 
the history of the texts: in particular, at line 10 (kaAobot pap. : KaAovot Kai Gal.) the papyrus offers a 
superior reading, since the conjunction is syntactically unfit; further discussion in CORAZZA 2016 ad 
locc. On the value of the variants attested in the papyri see above. 

159 Cf. MARGANNE 1980, 182-3. 

160 In particular, the first prescription of the second column (ll. 1-3) parallels Gal. Comp.med.loc. XII 
593,14 K. verbatim, while the following two (11. 4-8 and 9-15) show partial overlaps with ibid. XII 
594,1-4 (= Aet. VI 50,75-9) and XII 594,7 ff. K. All these recipes are ascribed to Heras. The remaining 
traces of fifteen lines, articulated in four more recipes, could not be identified with any known text. 
161 BONATI 2016b, 66. 

162 The Leiden+ tag for parallel passages is meant to mark omitted text that is supplied on the 
ground of parallels, so it does fall into a different typology (see below, editorial interventions). 
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date after practical and individual experience, the practice of annotating is wide- 
spread in the medical papyri.! The annotations can acquire either the organic format 
ofthe commentaries, autonomous exegetic treatises, the most illustrious examples of 
which are Galen's commentaries on Hippocratic texts!% (see e.g. Gal. In Hp. Epid. II 4 
= CMG V 10,2,1, p. 78,7-11 Wenkeback, where the author himself explains some rea- 
sons for compiling commentaries), or the scattered aspect of the scholia or marginal 
annotations: see the examples of P.Ant. 128 (http://litpap.info/dclp/60189), fragment 
of a III-century parchment codex from Antinoupolis with the text of Hippocrates' 
Aphorisms and marginalia, as well as of P.Ant. III 186 (http://www.litpap.info/ 
dclp/59961), a very fragmentary large-format papyrus codex from the same place, 
dated to the VI cent. AD, which contains sections of Galen's De compositione medica- 
mentorum per genera along with some scanty marginal annotations.!9 

In both cases, textual relations are complex. Commentaries refer to other texts 
but without exact parallels, except for literal quotations (see above); marginal notes 
refer to the main text without being part of it, so that the current treatment in digital 
editions may be slightly misleading, since it allows for marking the marginality of the 
passage (added to or written into the margins), but not the type of relation with the 
main body of the text. The XML syntax is clear: marginalia are encoded as plain text 
lines, with the indication of the margin attached to the line number.'? Let us consider, 
instead, a more complex case, represented — again from Antinoupolis — by P.Ant. III 
126 (http://litpap.info/dclp/65233): 


P.Ant. 3.126 (VI-VII secolo d.C.) é parte di un compendio sul trattamento farmacologico e chirur- 
gico della tonsillite e rappresenta un esempio di ‘enciclopedia medica’ redatta in epoca bizan- 
tina; il ritrovamento di testi come questo conferma l'idea che nella pratica medica antica la tra- 
smissione del sapere avvenisse tramite la combinazione di fonti tradizionali, tramandate per 
tradizione scritta, e di materiale desunto dalla pratica medica quotidiana e registrato proprio 
dagli specialisti che operavano sul campo. 

Il testo principale, ovvero quello scritto in carattere piü grande nella parte piü estesa di papiro, 
ë arricchito da annotazioni nel margine inferiore che riguardano alcune terapie farmacologiche 
da impiegare nel caso dell'insorgere delle patologie descritte nel testo, e tale modalità di uso e 


163 On the practice of annotating medical treatises with scholia and comments cf. ANDORLINI 2003; 
in general, on scholia and commentaries in the papyri cf. MESSERI SAVORELLI — PINTAUDI 2002.. 

164 Cf. MANETTI — ROSELLI 1994. 

165 cf. ANDORLINI 2000, 41-2; ANDORLINI 2003, 19-24 

166 Cf. CORAZZA 2018a. 

167 The two cases are tightly related, and can even merge together in the so-called “commented edi- 
tions" discussed by VANNINI 2015. 

168 See the observations by CORAZZA 2018a. On the interactions between text and glosses, very inter- 
esting is the analysis by MANIACI 2002, though referred to later types of texts. 

169 E.g.<lb n-"1,minf"/» for lines in the bottom margin (the other margins are indicated with 
msup, ms, md). In Leiden+ this information is added to the line number accordingly. 
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riuso del testo testimonia l’iter con cui il sapere tradizionale era compendiato, arricchito e inte- 
grato nei libri tecnici dai possessori dei testi. Le caratteristiche di layout, la consistenza dei mar- 
gini (quello inferiore, quasi totalmente conservato, misura 5 cm) e la scrittura regolare, oltre 
all’indicazione in alcuni punti degli spiriti, lasciano pensare che il frammento fosse parte di un 
codice di notevoli dimensioni e, dunque, di un certo pregio; il tipo di annotazioni riportate nel 
margine, anche in mancanza di notizie più specifiche circa l’uso di questo codice, fanno pensare 
che il redattore potesse essere un medico piuttosto competente o un soggetto forse ancora in 
formazione ma abituato alla pratica medica."? 


The relation between the marginalia and the text is tight, though the current markup 
can be arranged just as follows: 


P.Ant. 3.126, Fr. A 


13,minf. [a]Cuuou z .1 [.4]Awv rou Enpiou [.15] [e]rruxee Bepuov 
14,minf. [vJ5wo kar AeuorpiBe] koc pera .1 [.15] [e]Aatou kaTarrAacce 


15,minf. [.4]aprou .1 [.7].2 ovome .3 [.?] [mA]aoowwyv To kata 
16,minf.- [m\aoya] [.?] 


Fig. 7: Sample markup of marginalia according to the current standards (from BERTONAZZI 2018a, 73). 


Itis clear that we are not dealing with simple additions to the text, which are easily 
encoded with the «add» tag, further specified — with the place attribute — according 
to the position of the insertion (above, below, left, right, interlinear)."! Scribal addi- 
tions can be effectively utilized under particular circumstances, as in the case of 
P.Oxy. IX 1184v (http://litpap.info/dclp/60175), a I-century AD fragment exhibiting 
part of a collection of Hippocratic epistles likely arranged by theme (the extant texts 
deal with Hippocrates' invitation to Persia by the Great King, which he self-confi- 
dently refuses).!? The papyrus contains different versions of the Pseudo-Hippocratic 
letters 3, 4, 4a, 5, and 6a (ed. Smith), separated by initial ektheseis, and paragraphoi 
between each other.!? Ep. 3 is shortened at the end, and its ‘canonical’ conclusion has 


170 BERTONAZZI 2018a, 53-4; cf. also ibid., 73 for its digitization; CORAZZA 2018b, 46-57; on the anno- 
tations, MCNAMEE 2007, 463 ff. 

171 Forthe cases of scribal additions, Leiden+ recovers some traditional Leiden conventions, so that 
supralinear insertions are encoded between two slashes ( x/) and infralinear insertions with re- 
versed slashes (/ /xN N). The other types of additions are rendered as | |left:x| |, I | right:xll, 
| |interlin:x| |. 

172 Cf. BRODERSEN 1994, 103-7. 

173 The papyrus presents also interesting cases of intertextuality (see above): of Ep. 5, it transmits 
the shorter form with certain variations, while Ep. 6a, a letter to Gorgias previously unattested, has 
striking coincidences of phraseology with ‘canonical’ Ep. 6, addressed to Demetrius. 
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been appended as a supralinear insertion flowing into the right-hand margin - this is 
easily encodable with a combination of the two relevant <add> tags. But then Ep. 4 
was transcribed twice, in an abridged version in the main text, flanked by a shorter 
form without the introductory salutation (Ep. 4a), added into the right-hand margin 
and separated from the main body of letter 4 with an irregular vettical line. Further be- 
low, between letters 4 and 5 (ll. 17-19), three lines of comment appear, unattested else- 
where. 

Marginal or interlinear additions merge with comments in a complex metatextual 
net that sometimes overflows into the text itself"^ and show a remarkable ‘philologi- 
cal’ care for the text by the ancient scribes. The case of Hippocrates’ fourth letter, de- 
scribed just above, is rather meaningful: the marginal text is not a comment (like the 
following interlinear insertion) nor an addition (like the preceding supplementary in- 
sertion), it is an alternative parallel version, a proper variant of the text presented in 
the main body of the papyrus. In this case, the vertical, irregular line traced by the 
ancient scribe to divide the two alternatives acts as a proper indication of a textual 
variant.” We do find even more puzzling instances. P.Tebt. II 272v (http://lit- 
pap.info/dclp/60048, late II cent. AD) is a fragment of Herodotus Medicus’ De 
Remediis, describing the symptomatology of thirst and its treatment; the text corre- 
sponds in part to an excerpt of Herodotus Medicus preserved with Oribasius’ treat- 
ment of thirst in case of fever (Coll.Med. V 30, 6-7 Raeder = CMG VI 1,1). At a certain 
point, where the text reads aitiat tç mpoopopas (1. 5), introducing the different rea- 
sons for giving the sick something to drink, the scribe added two groups of three let- 
ters between dots above the line:"$ *t@v* above rñç, and *p@òv* above pac. This is not 
an addition supra or infra lineam, since it is clearly an alternative to the syntagm be- 
low (plural instead of singular); and since nothing appears deleted, it is not clear if 
the ancient writer wanted to correct the text or just juxtapose two different versions 
of the same passage."" We cannot be sure of what is going on here because this vari- 
ant is unattested in the manuscript tradition, i.e. in Oribasius’ passages quoting He- 
rodotus Medicus, which all feature the singular form. We would have ascribe correcting 
the form unanimously preserved by the manuscript tradition and replacing it with an 


174 Sometimes a marginal annotation can be swallowed up in the main text, generating a textual 
issue that can be likely explained only by means of metatextual correlations: this is the case with 
P.Gen. inv. 111 (http://litpap.info/dclp/63819), where the reading pàppatog ñ pi|[t]ov (11. 15-16), present- 
ing two technical terms that are almost synonyms, may stem from a gloss (cf. BERTONAZZI 2018a, 241-2). 
175 Itis worth noting that similar graphical devices are used by the author of the Anonymus Lon- 
dinensis to frame alternative versions of the same passage (cf. CRIBIORE 2018 and see below). 

176 Ithank very much Todd M. Hickey and Derin McLeod for the help in getting a high-resolution 
picture of the fragment. I mention this case in REGGIANI 2018a, 2018b, 2018c, with discussion of the 
tentative code used to digitize it. 

177 Writing a word between dots could be a way to highlight later corrections, like e.g. the koppa in 
P.Eirene III 25, 3 (III AD; see comment ad loc.). 
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unattested variant. The P.Tebt. editors speak of “correction or alternative reading”, 
M.-H. Marganne of *hésitation";"? if we should define it, we ought to call it a ‘scribal 
variant’, just as in the Hippocratic case presented above, as well as in P.Oxy. LVI 3851 
(http://litpap.info/dclp/61917, II-III AD), a fragment of Nicander’s Theriaka (333-4), 
which at 1. 12 reads npeopiotat[ov] (attested in most of the manuscript tradition) with 
a v added supra lineam between dots, being mpeoBvotatov an alternative version at- 
tested in some of the manuscripts (= Kv). 


4 [tot]¢ rñç Emu66ogoç xpóvovc met- 
- Tov. piv. 
[ou]ç altlar vfic npoogopüç cl(o,v° et 


Fig. 8: P.Tebt. 272,4—5 (courtesy of the Center for the Tebtunis Papyri, University of California, 
Berkeley). 


It is not easy to deal with these cases digitally,!? at least with the currently available 
tools, which deploy instead a full set of tags aimed at encoding plain scribal correc- 
tions, i.e. additions (see above), deletions («de1» tag with rend attribute describing 
the type of deletion: "erasure", "slashes", "cross-strokes"), and replace- 
ments (<subst> tag containing a nested «add place="inline"> tag defining 
the corrected text and an equally nested <del rend="corrected"> defining the 
replaced text).!5! 


178 MARGANNE 1981b, 76. 

179 TOMASI — ZAJA 2002 discuss some interesting solutions for the encoding of marginal writings, 
though dealing with quite later types of texts. 

180 Leiden+ employs the double square brackets as in conventional printed editions; only square 
brackets for plain erasures, [/ ] forslashes and [X ] for crosses. 

181 Leiden+ employs a | subst| tag working the same wasy as | reg| and | corr]. An interest- 
ingly complex case (P.Strasb. inv. 1187, A, i,11 = http://litpap.info/dclp/59968) is presented by BER- 
TONAZZI 2018a, 64 and 69-70: an ancient scribal correction, involving the insertion of a letter supra 
lineam, was read differently by two editors, so that they proposed two different interpretations, one 
of which involved a regularization. The newer reading is opetAtoT@v corrected in augu o//ráv i.e. 
cjuo àv; the previous reading was vw et Motwv corrected in vw Sei At‘w/twv. In Leiden+ this is 
encoded as <:<:<:0p1ALt@t@v|reglopstAt\w/tév:>|subst|opstAtotov:>|ed|v®, det 
«:AiNo/xov|subst|Aiotov:»-ed.pr.:», and the XML is cross-nested accordingly, so that the 
current display on the platform appears quite messed up. 
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The philological care testified by the cases of ‘scribal variants' mentioned above is 
even more patent when the text is an autograph'? and is equipped with authorial revi- 
sions, for which an important contribution can come from the XML annotation of ge- 
netic criticism phenomena recently developed by Elena Pierazzo.! Raffaella Cribiore 
has recently showed how genetic criticism — aimed at reconstructing the process of au- 
thorial constitution of a text — can be successfully applied to papyrological texts.’ 


4.10 Editorial interventions (modern) 


Modern alternative readings and editorial supplements do influence linguistic anno- 
tation, in that they add data, which are not stricto sensu ‘original’ to the text. Alterna- 
tives produce multiple possible readings, one of which is usually the most probable 
but without full certainty, and the other possibilities may well fit the context. Supple- 
ments, though most likely and in some cases pretty unavoidable, are nonetheless a 
modern contribution to the ancient fragmentary text and deserve a particular atten- 
tion. They can even be incorrect, and thus fall into the third category of editorial cor- 
rections, which encompass all modern corrections made to the readings of previous 
modern editors. Alternatives and editorial corrections are currently encoded as appa- 
ratus elements («app») defined by the type attribute and composed of a “lemma” 
(«lem»), i.e. the word in the text, and a “reading” («rdg»), i.e. the alternative in the 
apparatus. In Leiden- they are marked with the | alt | and the | ed| tag respectively. 
Such a markup strategy works fine from the ‘philological’ viewpoint, since it provides 
amain reading - supposedly the most correct — and a set of critical alternatives, either 
proposed by the same editor or sedimented through years of scholarship, with the 
possibility to indicate the authorial responsibility for each reading (resp attribute in 
the «1em» tag).'^ Nevertheless, the impossibility to search for combination of words 
including the terms in the apparatus makes this choice rather uneasy for the purposes 
of digital databanks, while different layers of text, each one featuring a single textual 


182 On medical autograph papyri cf. ANDORLINI 1997a, 22 with earlier bibliography; MARGANNE 2004, 90-1. 
183 Cf. http://www.tei-c.org/Activities/Council/Working/tcw19.html; PIERAZZO 2008. 

184 CRIBIORE 2018: see in particular the case of the medical Anonymus Londinensis and the related 
discussions of double versions. From the computational viewpoint, cf. MACE — BARET — BOZZI - CIGNONI 
2006 (in particular, PASSAROTTI 2006). Genetic criticism can be applied to some documentary catego- 
ries which show a certain complexity of textual composition. One may recall, just for instance, the 
legal documents of Ammon's archive, produced in multiple versions (P.Ammon II; cf. CRIBIORE 2018); 
Raffaele Luiselli's considerations about authorial revisions in Roman letters and petitions (LUISELLI 
2010); the mostly neglected cases of duplicates recently 'rediscovered' by Malcolm Choat and Rachel 
Yuen-Collingridge (YUEN-COLLINGRIDGE — CHOAT 2012); the composing process of administrative re- 
ports studied in the Project Synopsis at the Heidelberg University especially by Uri Yiftach (cf. REG- 
GIANI 2016c); the very recent discussion on drafts and copies by Andrea Jórdens (JÓRDENS 2017). 

185 See, however, DAMON 2016 for criticism of this way of handling apparatus readings by TEI. 
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alternative, may enhance the digital representation of the papyrus, especially when 
considering that editorial interventions occur more frequently in the medical papyri 
than in the documentary ones, for the peculiar attention to the editorial history that 
characterizes the items of the Digital Corpus of the Greek Medical Papyri.!8° 

On the other hand, supplements are tagged as such with the «supplied» ele- 
ment, and a reason attribute that defines the type of integration: " lost" ifthe orig- 
inal text is lost (Leiden+ square brackets), "omitted" if the original text was left 
over by the ancient scribe (Leiden+ angle brackets), "parallel" if the text is in- 
serted on the ground of a parallel text (Leiden+: pipes + underscores |__| ). The 
opposite case (removal of ancient surplus text) is marked with a «surplus» tag (Lei- 
den»: curly brackets). In this case, integration with the text layer is granted by the 
fact that the «supplied» tag indicates a text portion. This is even clearer if we com- 
pare it with the tag used to mark unsupplied lacunas, that isa «gap» tag with a rea- 
son attribute set to "Lost" (see above).!” 

The case of the gaps is indicative of the semantic difference between digital and 
paper edition. In a printed critical system, both supplied and unsupplied lacunas are 
marked with square brackets because the focus lies in the descriptive layer ofthe pap- 
yrological fact: a certain missing part of the text, which may be recoverable or not. In 
a digital critical context, we need to define whether a lacuna bears a textual meaning 
(i.e., a supplied text) or not (i.e., a gap in the text). Leiden+, following the printed 
conventions, adopts square brackets for both, in order to help the users; but the sys- 
tem automatically chooses the appropriate XML code according to the content of the 
brackets. Therefore, when the papyrus displays a partially supplied gap, which is en- 
closed by the very same pair of brackets in the printed edition, in the digital edition 
the two different parts (supplied and unsupplied) must be kept separated since they 
mean two different facts. Leiden- brackets are different than Leiden printed ones also 
in that the former must be always opened and closed at each gap, while in a printed 
edition they can be left open (or unclosed) if their exact extent is unknown. 

Somehow ambiguous, in conclusion, is the treatment of modern corrections in 
the case of misspelled words. Though the typical treatment involves the | reg| and 


186 Cf. CORAZZA 2018a; BERTONAZZI 2018a, 64-7 (with case studies) and 2018b. 

187 The theoretical assumption that the fragmentary status of the papyri may be thought as a (para- 
doxical) sort of ‘non-voluntary quotation”, selected by the chance and by the material circumstances 
rather than by the will of an author, would allow to envision a transtextual link between a ‘virtual’ 
hypertext (the original document, lost, more or less recoverable in a philological way) and the concrete 
hypotext (the actual fragment; cf. REGGIANI 2016a; see also ROMANELLO — BERTI — BOSCHETTI — BABEU — 
CRANE 2009, 160 and 162: “[...] fragments do not actually exist outside of scholars' interpretations. [...] 
Fragments are always scholarly reconstructions and interpretations of the content and structure of lost 
works”). This may allow for creating several multiple layers for editorial alternatives and supplements 
too, thus avoiding complicated nested tags as in the case of multiple alternatives in a series of different 
supplements or modern editorial readings (see the samples provided in the Conclusions below). 
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| corr| tags according to the type of intervention (see above), the use of traditional 

Leiden conventions (angle brackets for supplements of omitted letters, curly brackets 
for removal of surplus letters) is admitted in case of outright diplographies (“where 
the letter(s) is genuinely superfluous”, so say the guidelines)! or trivial omissions 
(for which it is preferred to the | corr| tag). 


4.11 Image 


When available, the addition of a digital picture is fundamental for a complete evalua- 
tion of the papyrus. The advanced possibilities of virtual objects, of which I discussed 
elsewhere, '? could be further enhanced by aligning text and image, a procedure that 
has been successfully attempted by the Anagnosis project at Wiirzburg.’”° 


5 Concluding remarks 


The so-called Michigan Medical Codex (P.Mich. inv. 21 = P.Mich. XVII 758, 
http://www.litpap.info/dclp/59332)"! resumes at the best most of the preceding argu- 
ments. Itis a IV-century small-format papyrus codex, of which thirteen leaves survive 
to an amount of twenty-six pages, in which numerous recipes are collected — seem- 
ingly — according to type of medication (pills and lozenges, then wet and dry plasters, 
at least in the extant pieces). Commissioned by a practicing physician,’” the docu- 
ment shows various degrees of textual interventions. In the original writing, recipes 
start with an indented heading, declaring the type of remedy, and are separated from 
each other with lines and small blank spaces; they typically contain the list of ingre- 
dients, followed by directions for composition and use. Many prescriptions are as- 
cribed to famous doctors, showing 


correspondences with recipes for plasters in the collections of Galen, Oribasius, Aétius, or Paul of 
Aegina that have come down in the manuscript traditions, highlighting the striking degree of con- 
tinuity among ingredients and their relative proportions from hand-written copy to hand-written 


188 For two cases in the medical papyri, see BERTONAZZI 2018a, 67-8 ({t@v otwapdv} in P.Strasb. inv. 
1187, A, i,14 and {o}oynpatioavtes in P.Lond.Lit. 166, iv,6). 

189 REGGIANI 2017, 137 ff. 

190 See above and the Anagnosis section of R. Ast’s and H. Essler’s chapter in this volume. 

191 YOUTIE 1996. For the following observations, I refer to HANSON 1996, HANSON 1997, 302-4, and 
ANDORLINI 2003, 26-8. 

192 Cf. YOUTIE 1996, 1-3; ANDORLINI 2003, 26-7. 
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copy over many centuries: the presence of plasters from a variety of different physicians suggests 
that the basic text of the codex was combining and taking its shape over considerable time.'” 


Then, the interventions by the owner of the codex: 


First he collated the text of his newly-made copy against an exemplar, making corrections in 
addition to the items already corrected by the scribe, and then he went on to more than double 
the contents of the codex by filling the margins with additional recipes for pills to medicate bod- 
ily ills and plasters to medicate wounds and lesions of every kind. Because empty space was 
limited, he emphasized separation between recipes through lines and marginal markers.!% 


Intertextuality, hypotextuality and similar connections merge together, creating a 
very complex and unique clockwork: “although individual recipes in a collection on 
papyrus often resemble items in the known authors, each extensive collection on pa- 
pyrus has thusfar proved to be a unique assemblage”.’” The paratextual function of 
critical and lectional marks stresses the “composite” structure of the text, "5 while au- 
thorial corrections and phonetic variants are not absent from the textual level. 

Let us compare a part of the printed edition with the corresponding digital edition 
currently featured in the DCLP,' followed by a tentative proposal of (partial) ontol- 
ogy network to describe the multiple textuality of the sample. 


ra 
¿ ra 
catre n e TED URS i 
3 x S Wap; p Q< $ n Ter Hs 
RESP" ua ARPA "fes icr 
OG sta ves DAN, EN 
PTET ~ Ty pa 
ex de papa i Ç i 
Yo omen Cun 
Nhe d CA 


JH 


Fig. 9: P.Mich. XVII 758 H r/v: main text with recipes taken from other authors; marginal annotations 
and additions with a reference system of coronides and other graphical marks (YOUTIE 1996, PI. 8). 


193 HANSON 2010, 197-8, and 1997, 303. 

194 HANSON 2010, 197-8, and 1997, 303. 

195 HANSON 2010, 199; cf. also the observations by BONATI 2016b, 60-9. 

196 Cf. ANDORLINI 2003, 26-7. 

197 The DCLP digital edition of the Michigan Medical Codex has been encoded by students of the 
Papyrology class (F. Bertonazzi, F. Corazza, L. Rizzardi, M.E. Galaverna, C. Bottioni, M. Catania, F. 
Giraldi, P. Lillo, G. Saccani, E. Mazzetti, L. Mazzolari, A. Brunazzi, E. Angolani, N. Pajares Collado, 
C.M. Ferrari) under the supervision of L. Iori, M. Centenari, I. Bonati, and N. Reggiani. 
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£vóca[c] Svetiq 
AÙTAV Li Sore 
ñ EunAactpoc dos [(vn +3 
xadaoac tà nétarla Ùcco-] 


nou wXAGSooO Tod 6[ 


—— 
(Hd.2) [É]unAactoc GvoSevlo)uévww cacc [ouEvy + 15] 


8 uot TH NOW 
éon (a) xnoo6 (oby.) a .[ 


4t utcuoc €c8( ) (60.) a a.[ 
xaABàv (nc) (o6vy.) B uo[ 
xaABávnv xai [ i 
[[...acacap]] sul 
6. n8ficac xed [ 


9. 5£56uxa, Pap. eow 10. Pap. ech 11. Pap. xarBay- 


Fig. 10: Printed edition of P.Mich. XVII 758 H r (YouriE 1996, 59). 


9 x 8 cm. 
Plate 8b 
]xai n[plòc toùc 
{x6Anove noocwa) tou [c) xat :G(v) 
[€AxZv cuvovuAo]uuévov xai E (neév) 
[npòc tà Suc]enobiwra ÉAxn nino (of) 


[nal noóc 6]& tac xtuétAac xš 


(&urAacrooc À Stà mwo]ueóXAuyoc noueóXoyoc (oby.) B 
[AL sanybpouv (oby.) Y ctéaroc dp)vudet [ou] (oby.)B xnooO (06y.) 6 


[xvAod Acvocnéouou (oby.). xauarunAiov] (oby.) € tà tnxtà 


[xarà Hv Aeiwv tñc ALSapyboov xJat tot nougóAuyoc 


3, marg. Pap. El 4, marg. Pap. nàng. 5. xewetAac: 


Fig. 11: Printed edition of P.Mich. XVII 758 H v (YouTIE 1996, 61). 
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folio H 


A É EN EE 
évedoats](*) Bueilg JC) [-ca.?-] 
avri [..] Sore [ -ca.?- ] 


ñ EuTrAaotpos poblivn - ca.3 -] 
xaBàpas(*) rà réraA[a úcocs-] 
trou xAdSou rou 5[ -ca.?- ] 


(hand 2) [£]urAacTos(*) &àgoBcv[o]uévoo caco[outvo - ca.14- J 
(hand 2) (added at left: ((coronis)) ) uoi(*) T® Tróvco 
(hand 2) SéSok(a)(*) knpou (ovy ) a. [ -ca.?- ] 
1@hand 2) (added at left: xp(norós)) ((cross)) (°) uíovos EoB(eoa(?)) (paxu ) a a [ -ca.?- ] 
(hand 2) xaABáv(ns) (ovryy ) B uv[ -ca.?- JC) 
(hand 2) xaAB&vny kai [ -ca.?- ] 
(hand 2) [. _ acacap] 8u[ -ca.?- JŒ) 
(hand 2) ëšin8ñooç xpo [ -ca.?- ] 


[-ca.?-] kai m[p]os rous 
[kéAtrous mpooqó]rou[s] kai <étri(?)> t@(v) 
[EAxdv ouvovAo]uuévoov(*) kai 
[mpòs tà Suo]emovAwta EAKN 
5 [kai mpds 8]š Tas xiuévAas(*) Ke(*) 
3,md(hand 2) £(npóv)(*) 
4,md(hand 2) mAnp(oi) 


(hand 2) [ÉurrAac pos ñ Sià mo]upóNuyos moupéAuyos (ovyk ) B 


(hand 2) [xuAoù Aivoomépuou (ovryx ) 1 xauapyrAou] (oùyx ) e rà rnkrà 
(hand 2) [ka rà T@v Acicov Tris Ai9apyvpou klai TOU rouipóAvyos 


^ H.r.1-2. or [ouv] |£évo;ca [s] 
^ H.r.2. or svep[y s] 

^ H.r.4-5. or [ -ca.?- ava] |ka8ápas, or [ -ca.?- 51a] |ka&ápas 

^ H.r.7. (hand 2) [c]umAaoroo corr. cx [c]ulAjaovoo, |. £yrÀAaotpos 
^ H.r.8. or [£] | pot 

^ H.r.9. |. 5éBcok(a) 

^ H.r.10. or A(áuBave(?)), or AláBe(?)) 

^ H.r.11. or uv[píkns -ca.?- ], or pu[poivns -ca.?- ] 

^ H.r.13. or 8u[eíq -ca.?- ] 

^ H.v.3. or [érrovAo]uu£vcov 

^ H.v.3,md. or E(npiov) 

^ H.v.5. corr. cx xipé6Aas 

^ H.v.S. L koi 


Fig. 12: DCLP digital edition of P.Mich. XVII 758 H r/v (http://www.litpap.info/dclp/59332). 
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adv [. 


011xoyodAy si 


is hypotext tb 


curly paragraphos 
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01 1xaj esed si 


Ced 


Fig. 13: Tentative ontology model for P.Mich. XVII 758 H r. Some layers are simplified; note that in 
the metatext layer (below) the hypotext and the hypertext are merged (and some editorial supple- 
ments and alternatives are missing) in order to give space to the annotation of abbreviations and 
symbols, which clearly shows their intensive deployment by the second hand (= the owner of the 
codex). Noteworthy is [£|urAaotoc (corrected from [£]u[[AT, l. 7), which is a substandard spelling 
variant of the more common £ymAaotposc: as already noted by YouriE 1996, ad loc., “ÉumAaotog, ac- 
cording to Galen (XIII 372 [K.]), was an earlier form of £urrAaotpoc". Moreover, the entire metatex- 
tual paragraph must be noted as written in ekthesis in the bottom margin, which is marked line by 
line in the Leiden+ code. Here, it would suffice to encode it as a marginal metatext and to connect it 
to an ekthesis paratext layer (see the following sample). 
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] Kal z[p]óc tobc 


[KóAxovc zpoaqodá]rov[c] Kai (£xi(?)) xv 
[£xéxKguvovAo]upévo? xal. qusdilitiaMiitlas, 


[xpdc tà Svo]eroviota £n 
[xc xpóc 8]à tac xuétdac xè damno [crt 
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—> 
Go 
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metgtext to 


Paul.Aeg. VII 17,31 


is intertext to 


ekthesis 


Fig. 14: Tentative ontology model for P.Mich. XVII 758 H v. Some layers are simplified; in the 
metatext layer (below) the hypotext and the hypertext are merged as in the preceding sample, but 
here symbols are not handled in order to give space to the ekthesis paratext layer and to the 
intertextual layer, since the recipe added to the bottom margin (ll. 6—9) closely recalls (in the 
typology and number of the ingredients and in their quantities) a passage of Paul of Aegina (VII 
17,31). Note also how the right-hand-margin additions are handled as metatext layer connected to 
Il. 3-4 ofthe main text, whereas the Leiden markup does not handle the situation properly 
(marginal lines can be added within the text, or at the end, but in both cases some metatextual 
information gets lost). Quite interestingly, the scribal phonetic correction yiuétAag for xui£[[8]Aoc (I. 
5) attests to a preference for a form used by Paul (e.g. IIl 79,1) rather than other medical writers (e.g. 
Orib. Coll. IV 615,19; 620; Syn. VII 45; Gal. XIII 380,5 K.; 383,17 K.). 


Admittedly, printed or printed-like media are physically limited as to dealing with 
complex degrees of textuality, and adopted the critical edition model as a way of fix- 
ing a text for scholarly purposes. On the contrary, ancient textual criticism — recog- 
nizable in the commentaries, the annotations, the philological interventions, the par- 
atextual care deployed by the ancient scribes and scholars — was apparently a way to 
pass down knowledge, i.e. a means of text transmission rather than text reconstruc- 
tion and fixation. Nowadays, thanks to the digital tools, we do have the occasion to 
develop digital infrastructures in a hyper-dimensional cyberspace to overcome tradi- 
tional criticism and its shortcomings, and to conceive a digital critical edition with 
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deeper and deeper levels of text analysis (markup tagging, linguistic or semantic an- 
notation layers, in-text information).!8 

As BODARD — GARCÉS 2009 argue, a major advantage of digital editions (namely the 
papyrological ones) is the possibility to get back to the materiality of texts, avoiding 
the philological necessity of reconstructing an archetype and focusing on text trans- 
mission instead. “[A]ttention would be better focused on how to present a text with 
multiple manuscript witnesses to a reader in a digital environment”: 


Digital editions may stimulate our critical engagement with such crucial textual debate. They 
may push the classic definition of the ‘edition’ by not only offering a presentational publication 
layer but also by allowing access to the underlying encoding of the repository or database be- 
neath. Indeed, an editor need not make any authoritative decisions that supersede all alternative 
readings if all possibilities can be unambiguously reconstructed from the base manuscript data, 
although most would in practice probably want to privilege their favoured readings in some way. 
The critical edition, with sources fully incorporated, would potentially provide an interactive 
resource that assists the user in creating virtual research environments.”” 


Thus, the authors hople] that digital or virtual research environments would support the creation 
of ‘ideal’ digital editions where the editor does not have to decide on a ‘best text’ since all edito- 
rial decisions could be linked to their base data (e.g., manuscript images, diplomatic transcrip- 
tions).?9! 


Similarly, NICHOLS 2009 states that the ideal ofthe archetype text and textual criticism 
is an “artefact of analogue scholarship” consequent to the limitations of the printed 
pages. Conversely, 


[t]he Internet has altered the equation by making possible the study of literary works in their 
original configurations. We can now understand that manuscripts designed and produced by 
scribes and artists — often long after the death of the original poet — have a life of their own. It 
was not that scribes were ‘incapable’ of copying texts word-for-word, but rather that this was not 
what their culture demanded of them. [...]. [I]t requires rethinking concepts as fundamental as 
authorship, for example. Confronted with over 150 versions of the work, no two quite alike, what 
becomes of the concept of authorial control? And how can one assert with certainty which of the 
150 or so versions is the ‘correct’ one, or even whether such a concept even makes sense in a pre- 
print culture.” 


Thus, the digitization of manuscripts and the creation of digital critical editions have not only 
provided new opportunities for textual criticism but also might even be viewed as enabling a 


198 L. Berkes, in his chapter for this volume, asks: “should we expect online editions to conform to 
the norms of traditional printed editions or should we accept them as a slightly different form of 
publication?” 

199 BABEU 2011, 36. 

200 BODARD - GARCÉS 2009, 96. 

201 BABEU 2011, 36. 

202 NICHOLS 2009. 
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type of criticism that better respects the traditions of the texts or objects of analysis them- 


selves?9, 


Consider also the reflections of CAYLESS 2010 about the prominence of the transmis- 
sion of content on its external appearance: 


[plagination is a relatively fragile construct in the digital age”, and textual “accretions” like com- 
mentaries, glosses and marginal notes, progressively gathered around the main text in its his- 
torical transmission, can be effectively encoded and represented in digital editions that not 
simply replicate print technologies.*™ 


When we note (again after CAYLESS 2010, 162) that — functionally and theoretically — 
traditional commentary is a hypertext in print,” everything comes full circle, and it 
appears clearly how the new technologies can produce a very similar outcome as the 
ancient textual criticism described above. It can be argued, therefore, that a digital 
critical edition can develop into something completely different from the somehow 
‘old-fashioned’ printed critical edition: namely, a further step in the fluid textual 
transmission of ancient sources. 
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Rodney Ast, Holger Essler 
Anagnosis, Herculaneum, and the Digital 
Corpus of Literary Papyri 


1 Overview 


The Digital Corpus of Literary Papyri (DCLP) initiative was launched in July 2013 with 
support from the NEH/DFG Bilateral Digital Programme and upon successful comple- 
tion of a one-year planning grant under the same program. The project, which ended 
in August 2017, built the necessary framework for a large-scale corpus of literary pa- 
pyri on the basis of infrastructure already in place at www.papyri.info for documen- 
tary papyri. And by “literary papyri” we mean both canonical literary genres, such as 
epic, lyric, drama, oratory, etc., and so-called para- or subliterary texts, whether mag- 
ical, medical, or school. All content has been encoded in the well-established XML- 
TEI format known as EpiDoc. An attempt has been made to customize search, browse, 
and editing functionality to the needs of individuals who deal with literary texts. At 
the same time we have wanted to encourage engagement with all extant papyrologi- 
cal sources, both literary and documentary. As a result, users can search across the 
entire corpus of texts in the Duke Databank of Documentary Papyri (DDbDP) and 
DCLP. 

Despite being headquartered in New York and Heidelberg, the DCLP has profited 
from collaboration with a large number of researchers and institutions across Europe 
and the United States. Mark Depauw and Willy Clarysse in Leuven shared metadata 
belonging to the Leuven Database of Ancient Books (LDAB) for nearly 15,000 objects. 
This information constitutes the backbone of each DCLP record (Fig. 1 with metadata 
from LDAB). 


This articles stems from two separate talks delivered by the individual authors at the conference 
“Greek Medical Papyri. Text, Context, Hypertext” (Parma, 2-4 November 2016). Ast is responsible for 
the part on the DCLP, while Essler is behind the sections on Herculaneum and Anagnosis. Both au- 
thors read and commented on the complete article. 


1 Principal investigators on the project were Rodney Ast (DFG) at the University of Heidelberg and 
Roger Bagnall (NEH) at New York University. Both Ast and Holger Essler have presented the project 
on numerous occasions, and a description of the initiative can be found also in REGGIANI 2017, 250-4. 


8 Open Access. © 2018 Rodney Ast, Holger Essler, published by De Gruyter. | COEXZTZTM This work is licensed 
under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110547450-003 
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Browse: DObDP HGY APIS DCLP Authors 


DCLP/LDAB Data [xml] 


Availability 
Origin 


Fig. 1: LDAB metadata. 


Similarly, cooperation with Duke University's Duke Collaboratory for Classics Com- 
puting (DC3) allowed the DCLP team to build on the standards and tools in place at 
Papyri.info. Holger Essler and his team in Würzburg, together with Gianluca Del Mas- 
tro in Naples and Daniel Riaño in Madrid, were responsible for the addition of hun- 
dreds of files containing bibliography, links to sketches, engravings and photo- 
graphs, and transcriptions of Herculaneum papyri. The Parma Medical Project, under 
the leadership of Isabella Andorlinit and Nicola Reggiani, produced extended edi- 
tions of over 200 medical papyri. Furthermore the project benefited from the partici- 
pation of numerous students and scholars who have both entered transcriptions and 
vetted submissions. 

Before detailing what an individual DCLP record might contain and what the pro- 
ject seeks to cover over the long term, we will first speak more generally about the 
nature and scope of the initiative. The DCLP is not the first large-scale online resource 
for classicalliterature. The Thesaurus Linguae Graecae and the Perseus Digital Library 
both offer searchable and browsable transcriptions of ancient texts.? The most obvi- 
ous difference with these initiatives is the fact that DCLP covers only papyrological 
evidence, including papyri, pre-medieval parchment, ostraka, tablets, dipinti, and 
other non-inscriptional evidence. It does not incorporate medieval manuscripts, and 
its interest is not strictly in the ‘text’ per se, but rather in the text as it appears on any 
given material substrate. In this respect, it focuses on the inscribed object as a whole, 
and tries to account for extra-textual elements such as layout and non-verbal signs 


2 The former, which is largely confined to Greek literature, is hosted at http://stephanus.tlg.uci.edu; 
the latter, which comprises both Greek and Latin, can be found at http://www.perseus.tufts.edu/hop- 
per. Colleagues at Tufts University and the University of Leipzig are working in a much more ambi- 
tious project called the Open Greek and Latin Project (OGL) to expand Perseus in order to include all 
Greek and Latin texts, http://www.dh.uni-leipzig.de/wo/projects/open-greek-and-latin-project. 
DCLP has the potential both to enhance OGL and benefit from it. 
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(e.g., paragraphoi, diplai, etc.), in addition to the written words.’ As a result, a higher 
premium is placed on accurate decipherment of all elements of the textual witness. 
This is not to say that a photographic representation of these elements is offered in 
the HTML text, but links to photographs are provided when possible, so that the user 
can observe all features of the inscribed object. 

We have also tried to make it easy to discover content in the DCLP. Text-search 
functionality is the same as in Papyri.info, as are many of the browsing options, and 
we have retained the faceted browsing capability. In addition to finding texts by their 
publication numbers, provided the publications are known to the Checklist of Editions 
(http://www.papyri.info/docs/checklist), one can also locate them by TM numbers 
(Fig. 2 with three browsing options). 


Browse: DDBDP MGV APIS DOP TM Number orSearch: Data Bibliography 


a browse by series 


pyrological Navigator (PN) for searching and browsing and Papyrological Editor (PE) for peer-reviewed curation c 
1 papyri.info does for Gr An ultimate is also ta ide se 


baume by TM member 


gs (LAB), Bibliographie Papy que (t 
he University of Würzburg and the Duk 


More information 
igital Papyrology blog 


Fig. 2: DCLP browsing options. 


The latter is the most effective means of finding a known object, since every item has 
a TM number, but prior knowledge is a prerequisite, and in that respect it hardly can 
be described as a true browsing function. The author/works browsing option, on the 
other hand, represents an entry point that should be welcomed by the literature- 
minded user who wants to see how many papyri by a known author survive (Fig. 3). 
In addition to giving the names of extant works by known authors, the system also 
says when there is a Greek text available. Here again, though, DCLP is not charitable 
to ignorance: works by unknown authors do not appear in the list. 


3 The project is very much in tune with current efforts in classical studies to account for physical 
aspects of ancient witnesses. One example of these efforts is the University of Heidelberg’s Sonder- 
forschungsbereich 933, Materiale Textkulturen; more information about this SFB can be found at 
http://www.materiale-textkulturen.de. 
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Euripides 
e Aigeus 
* Euripides, Aigeus 59946 (na;;59946) (Greek text available) view in delpxsltbox | github 
* Alcestis 
* Euripides, Alcestis 68623 (p.0xy;67;4546) (Greek text available) view in dclpxsltbox | github 
* Euripides, Alcestis 68624 (p.0xy;67;4547) (Greek text available) view in delpxsltbox | github 
* Alemaeon 
* Euripides, Alemaeon 59812 (psi;13;1302) view in dclpxsltbox | github 
* Alexandros 
e Euripides, Alexandros 115700 (na;;115700) view in dclpxsltbox | github 
* Andromache 
e Euripides, Andromache 59873 (p.0xy;67:4551) (Greek text available) view in dclpxsltbox | github 
Euripides, Andromache 59824 (p.0xy;31;2543) view in dclpxsltbox | github 
Euripides, Andromache 59857 (p.harris;1;39) view in delpxsltbox | github 
Euripides, Andromache 59878 (na;;59878) view in dclpxsltbox | github 
Euripides, Andromache 59833 (p.0xy;22;2335) view in dclpxsltbox | github 
Euripides, Andromache 59897 (na;;59897) view in dclpxsltbox | github 
Euripides, Andromache 59931 (p.hib;1;25) view in dclpxsltbox | github 
Euripides, Andromache 59899 (p.ross.georg;1;8) view in dclpxsltbox | github 
Euripides, Andromache 59858 (p.0xy;3;449) view in dclpxsltbox | github 


Euripides, Andromache 59876 (na;;59876) view in dclpxsltbox | github 

Euripides, Andromache 68625 (p.0xy;67;4552) (Greek text available) view in dclpxsltbox | github 
Euripides, Andromache 68627 (p.0xy;67;4554) (Greek text available) view in delpxsItbox | github 
Euripides, Andromache 68626 (p.oxy;67;4553) (Greek text available) view in dclpxsltbox | github 


Euripides, Andromache 68628 (p.0xy;67;4555) (Greek text available) view in delpxsltbox | github 


Fig. 3: DCLP authors and works. 


DCLP is not only a portal to information about published papyri, it is also a curatorial 
platform. Using the same editor employed at Papyri.info, the system allows users to 
propose emendations to published texts and to contribute introductions, commen- 
taries, translations, as well as updated, machine-readable bibliography. All new con- 
tentis vetted by members of the editorial board. The DCLP instance of Berlin P. 12310, 
an ostrakon preserving five verses of Theognis (vv. 434-8) plus an unidentified com- 
edy fragment, is an example of an emended text.^ The Theognis verses were thought 
by the original editor to be inferior to those preserved in other witnesses, one of which 
is Plato's Meno (95e). Even though the first lines are transposed and thus differ from 
the Theognidean manuscript tradition, the text is not as banal as the original editor 
thought. The ordering of the lines on the ostrakon agrees with the text found in Plato 
and the unique and incomprehensible reading of mdAw that is printed by the first ed- 


4 See http://litpap.info/dclp/62823, which contains extensive bibliography. 
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itor of the ostrakon in line 3 has turned out upon closer examination by Julia Lou- 
govaya to be a mistaken reading.’ The sherd actually has xoAAoóc, which is attested 
in all other witnesses. Lougovaya has emended the online text accordingly (Fig. 4, 
http://litpap.info/dclp/62823). 


Apparatus 


Fig. 4: DCLP text, apparatus, and notes. 


The Parma medical project has been a significant contributor of new content, includ- 
ing extended editions of many papyri. P.Yale II 134 is representative of this work.° The 
introduction to the online edition, which is authored by N. Reggiani, briefly describes 
the content of the text, which is a set of iatromagical prescriptions. This is followed 
by the edition, critical apparatus, and commentary. The header gives bibliographical 
references and information about the date, provenance, physical dimensions of the 
fragments, etc. It also contains a link to an image of the papyrus at the host institu- 
tion. While mostly based on previous printed editions, the DCLP record is itself a 
stand-alone edition. 


5 The ostrakon was first published by VIERECK 1925, 254-5, and subsequently in PORDOMINGO 2013, no. 
27. Wolfgang Luppe arrived independently at the same conclusion about Viereck's reading of raw 
in his re-edition of the ostrakon in CPF II, 2, pp. 358-9. 

6 See http://www.litpap.info/dclp/64529. 
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2 Herculaneum 


The texts of the Herculaneum papyri were also encoded on the basis of a reference 
edition, but provide additional bibliography and links to images. Our starting point 
was the Thesaurus Herculanensium Voluminum, the first full-text database of the Her- 
culaneum papyri. It currently contains 26 texts with some 20k lines that were entered 
jointly by the Centro Internazionale per lo Studio dei Papiri Ercolanesi (CISPE) and 
the Wiirzburg Institute of Classics.” Since we firmly believe that uniting the Hercula- 
neum papyri with the literary papyri on a single platform is an important step towards 
the unity of our discipline and, in particular, towards the development of Hercula- 
neum Papyrology, all texts were transferred to DCLP and further work was and will 
be based on this database. Currently DCLP comprises 117 editions of Herculaneum 
papyri and can be considered fairly complete for this group. 

Since new editions of Herculaneum papyri tend to take decades, it was necessary 
to refrain from reediting or rechecking readings on the original papyri. In order to 
make the texts available as soon as possible we had to compromise even further: text 
entry was strictly based on the most recent comprehensive editions. Especially where 
readings are highly disputed, it would have been impossible to decide without thor- 
ough study of the whole papyrus. Instead, we were striving to provide a complete 
bibliography of editions of Herculaneum papyri. Currently this first-ever complete in- 
dex comprises more than 1,400 records, of which 387 are reference editions, the cen- 
tral focus and basis of the digital text. In general, there are four categories (see the 
metadata in Fig. 5). 

a) Reference editions: the bases followed for text entry 

b) Previous editions: any edition predating the reference edition, which may be con- 
sulted in the future for the apparatus and concordances 

c) Partial editions published after the publication of the reference edition 

d) Readings: individual readings without establishing a syntactic connection, pub- 
lished after the reference edition 


Although many partial editions provide substantial improvements to the text, their 
incorporation was not feasible at this stage. Besides, the current software does not 
allow to reference parts of text to particular editions. 

The prerequisite for a uniform data structure was the establishment of the rela- 
tionship between the canonical numbers of Trismegistos (http://www.trismegis- 
tos.org) and the LDAB and the traditional inventory numbers of the Herculaneum pa- 
pyri (there referred to as Gigante Numbers, following the Catalogue edited by him).? 


7 (12November 2017). 
8 Alist of Herculaneum texts entered is available at: http://epikur-wuerzburg.de/thv. 
9 GIGANTE 1979. 
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While Trismegistos and LDAB assign an identification number to each ancient book 
or scroll, the inventory numbers of the Herculaneum papyri refer to the fragments as 
they were inventoried in the 18° century. Several scrolls were already broken in pieces 
at the time of the excavation, some were cut to facilitate unrolling, and every piece, 
be it a stack of several layers from the outer part of the scroll or a part from the central 
cylinder, was assigned a different inventory number. Over the years, these pieces had 
a story of their own, and might now be in very different condition. In addition, diffi- 
culty in discerning the text on the darkened surface also resulted in miscataloguing 
single fragments. Thus there are several cases where a single inventory number con- 
tains fragments belonging to different papyri while often a single scroll (correspond- 
ing to one TM number) has to be reconstructed from several inventory numbers.” In 
several cases fragments that have been edited separately are now proven to belong to 
the same scroll. An example is TM 62499, containing Philodemus De rhetorica II (cf. 
Fig. 5). The scroll can be reconstructed from five inventory numbers, but no compre- 
hensive edition is available. We have thus limited ourselves to assembling the text 
from the four reference editions that cover a maximum of the text preserved. Since 
they were published by different editors at different times (from 1892 to 1977), it is not 
surprising they differ considerably in scope and method. 


Fig. 5: Metadata of TM 62499, Philodemus On rhetoric 2. 


10 Examples are TM 62400, Philodemus De pietate, and TM 62419, Philodemus De poematis 1 (cf. e.g. 
the scheme in OBBINK 1996, 43; JANKO 2003, 105). 
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As a consequence of their preservation and the combination of fragments, the surviv- 
ing remains of a Herculaneum papyrus are on average considerably larger than those 
of the Egyptian fragments. They normally exceed 600 lines and arrive at a maximum 
of nearly 4,500 lines for Philodemus De musica IV, distributed among more than one 
hundred fragments. Since for many fragments several images are available online, 
the traditional separation of text and metadata as respected in the display of Pa- 
pyri.info resulted in extremely long lists of information without any visible connection 
to the corresponding texts. To overcome this difficulty we introduced a system of link- 
ing each fragment or column with specific metadata by virtue of a corresp attribute 
(Fig. 6). This allowed us to include links to the more than 6,000 images of Hercula- 
neum papyri or their apographs that are available on the internet. 


to="2015" 
id="GR6672.01"> 
1e"http: / he. sub.museun/berlpap/Origina)/P_06845_R.jpg"/> 


«text? 
<body> 
«head wml: lang-"en"/> 
«div xml:lange"grc" typee"edition" xml:spacee"preserve^» 
«div ne*1*|cor 8986672] subtypee"frageent" typee"textpart^»cab» 
«1b ne"433"/>csupplied evidencee"parallel" reasone"lost"»tioww 6° Spo. pè</supplied>cunclear>ve/unclear> ADoav «oAA«unclearsttptyacc/unc14 


«1b n«*435*/»«supplied evidence="parallel" reason="lost">Appara 6° ExAww«/supplied»«unclearog«/unclear»v mpbq tvümvto mapéavóuvta- 
«lb n="436"/><supplied evidencee"parallel" reasone*lost"»aütat 6€ xpue</supplied><uncleersé</unclearsovow ènt xAvoyotox xádvCov- 
«1b n«7437"/»«supplied evidencee"parallel" reasone"lost"»uiyÓ BAdo</supplied>cunclearsie</unclear>: Ocoto, Giov reringtvar ñtoo- 


Fig. 6: XML of TM 60411. 


By virtue of the corresp attribute the data about an image is then displayed together 
with the text of the fragment it depicts (Fig. 7). This system is followed throughout in 
order to link a fragment with specific image files, whereas Images in the header refers 
to an internet resource (mostly the site of a papyrus collection) where images are 
available. In the screenshot, one can see under the title the inventory number of the 
papyrus and its corresponding fragment, as well as information and links to images 
of various types. Furthermore, in fragment 4 there is additional markup at the begin- 
ning of the line: the lower square brackets mark text supplied on the basis of a parallel 
tradition, which has been drawn on in order to supplement parts lost in the original. 
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DCLP Transcription [xml] 


1 
oiumn š 


P.Herc. 1672 


ketched 1781-1798 by 


Fig. 7: display of TM 62499, Philodemus On rhetoric 2. 


Digitization of the Herculaneum papyri was greatly facilitated by collaboration with 
Daniel Riaño, who is using his software AristarchusX for grammatical analysis and 
author recognition in the Herculaneum papyri." He transcoded all the texts previ- 
ously available at the Thesaurus Herculanensium Voluminum and many others. His 
work yielded further improvements, such as automatic lemmatization and spell- 
checking of the text. At a first stage we had included lemmata and numbered single 
words in order to allow linking to his software. However, this further level of infor- 
mation created problems both to the human reader of the XML and to the transfor- 
mation stylesheets of the Leiden+ converter. We thus decided to suppress it for the 
time being while exploring solutions with stand-off markup. 


3 Anagnosis 


The Anagnosis project may be considered the first project user of DCLP as it natively 
builds upon the texts encoded there while it also aims at expanding and enlarging 
coverage of ancient authors. Being part of the Kallimachos project (http://kal- 
limachos.de), whose objective it is to create a regional centre for digital edition and 
quantitative analysis in Würzburg, Anagnosis is working on image analysis. The aim 
is to link illustrations and transcriptions of papyri or, more precisely and technically, 
tolink the edited texts of Greek literary papyri from the full text database of the Digital 
Corpus of Literary Papyri with the digital images of the originals provided by the indi- 
vidual collections. 


11 See RIANO RUFILANCHAS 2014. 
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By default, the link under Images in the metadata taken from LDAB leads to the 
page of the respective repository or collection that provides the images. In total, there 
are some 12,000 digital illustrations available covering approximately 10,000 Greek 
literary papyri (TM numbers). However, the distribution is uneven: while no images 
of many literary papyri are available, there can be hundreds of a single Herculaneum 
papyrus, reproducing different fragments and columns, which were taken at different 
times and with different methods. Thus 152 Herculaneum papyri are covered by 6,000 
illustrations, and more than 3,000 of them are engravings from the 18° and 19° cen- 
turies. It was partly for the needs of and thanks to the data provided by Anagnosis that 
the new display of links to individual images above single fragments was introduced. 
This makes it possible to automatically create a parallel display of image and text 
(Fig. 8). 
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Fig. 8: TM 60411: Homer, /liad VIII 433-47. Image O BerlPap. Berliner Papyrusdatenbank. 


The output is created entirely from separate digital sources and is loaded ad hoc for 
further processing. The image is retrieved from the server of BerlPap, a project of the 
Berlin Papyrus Collection, whereas the transcript comes from DCLP. Thus Anagnosis 
links the two resources by virtue of the TM number and the fragment number, if ap- 
plicable. As explained above, these links are stored in the corresp-attribute in DCLP. 
Some might find already this parallel display useful, but in the perspective of An- 
agnosis this represents just the starting point ofimage analysis, with the ultimate goal 
being to link each letter of the transcript to the corresponding zone in the image. This 
will allow us to extract and display a complete alphabet of the letters present in the 
papyrus, a tool traditionally used for deciding on the reading of damaged areas. Such 
a complete set of images for each letter will further enable automated examination of 
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the uniformity of shapes and of the scribe’s consistency. In addition, there is the pos- 
sibility of automated graphic reconstruction to evaluate the spacing of proposed sup- 
plements. 

The development of the underlying software was carried out at the German Re- 
search Center for Artificial Intelligence by Saqib Bukhari and will be published in spe- 
cialized journals related to image analysis.” The software will be freely accessible for 
use and available for download. Anagnosis will only produce snippets of single letters 
and coordinates in a stand-off TEI format. Thus by working with the software the user 
is not requested to give away images of original papyri, but he may contribute by en- 
larging the pool of letter snippets that may be used for further research. 

Thus the aim of Anagnosis is not classical OCR on illustrations of manuscripts, 
but exactly the opposite: starting from a line-exact copy, as it is present in the Digital 
Corpus of Literary Papyri, the corresponding letter zones within the illustration are 
referenced to the known letters of the copy. The main reason for this is, of course, the 
many technical difficulties that stand in the way of automated text recognition of 
handwriting and for which there is still no satisfactory solution. We therefore use the 
existing material to go a simpler way first. At the same time, Anagnosis works with 
the same components that are essential for character recognition: the original image, 
the mapping vectors and the transcript. The only difference is in the direction of the 
assignment. Anagnosis thus creates a large annotated corpus of correct OCR results 
from papyri, even though these results came about through detours. This corpus may 
then be used in future projects as training data for machine learning. 
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Lajos Berkes 
Perspectives and Challenges in Editing 
Documentary Papyri Online 


A Report oon Born-Digital Editions through Papyri.info 


1 Text editions at Papyri.info 


This contribution reports on the digital publication of Greek documentary papyri 
through the platform Papyri.info: 


Papyri.info has two primary components. The Papyrological Navigator (PN) supports searching, 
browsing, and aggregation of ancient papyrological documents and related materials; the Pap- 
yrological Editor (PE) enables multi-author, version controlled, peer reviewed scholarly curation 
of papyrological texts, translations, commentary, scholarly metadata, institutional catalog rec- 
ords, bibliography, and images. 


The PE allows registered users to contribute content to the database. The submissions 
are then vetted by members of the editorial board. These contributions vary from cor- 
recting typos or entering translations to proposing new readings in already published 
texts. After this feature was introduced, the submission of corrections led quickly to 
the realization that in some cases extensive corrections may well justify reeditions of 
texts. This was a further impetus to experiment with accommodating born-digital edi- 
tions of documentary papyri at Papyri.info.’ 

Six papyri have been published so far in Papyri.info and some more will appear 
soon. The first edition dates back to 2013, when Nikos Litinas transcribed the back of 
P.Corn. 34 via Papyri.info. Subsequently and after much consideration, the editorial 
board of Papyri.info decided to present his readings not as corrections, but as a proper 
online edition.’ In 2014, the idea of exploring this issue in the framework of a seminar 
at the University of Heidelberg arose. In a class on digital papyrology offered in the 
summer semester 2014 by Rodney Ast, James M.S. Cowey, and myself several un- 
published papyri were discussed and prepared for a digital edition. These were 
Gothenburg papyri that were only described in H. Frisk's 1929 publication (P.Got.). 
The editions were peer-reviewed by at least two specialists in the field and appeared 
exclusively in Papyri.info. The three texts were fragmentary letters from the VI-VII c. 


1 http://papyri.info. For a detailed discussion see REGGIANI 2017, 222 ff. 
2 Born-digital editions were already included in the original concept of Papyri.info. 
3 DDbDP 2013 1: “Account of Barley and Wheat": http://papyri.info/ddbdp/ddbdp;2013;1. 


8 Open Access. © 2018 Lajos Berkes, published by De Gruyter. [C EXT] This work is licensed under the 
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110547450-004 
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AD.‘ The publication of these documents was announced in the Bulletin of Online 
Emendations to Papyri 5.1 (January 7, 2016)? More recently, two ostraka were repub- 
lished in Papyri.info. The descriptum O.Did. 37 was emended by Hélène Cuvigny to 
an extent that justified a new edition, although no introduction or line-by-line com- 
mentary was added.’ O.Berenike II 237 was republished with introduction and com- 
mentary by Roger Bagnall and Rodney Ast.’ 

Work on these editions was a challenging task and not only for the usual schol- 
arly reasons. Although using databases has very much become an indispensable tool 
for papyrologists, producing an edition which appears exclusively online but con- 
forms to the well-established form of papyrological editions turned out to be quite 
difficult. In what follows, I will outline the main problems we faced during this pro- 
cess and present the preliminary solutions that we have been able to come up with. 
The form of online editions of documentary papyri is far from final: which conven- 
tions and solutions will be adopted depends very much on the reactions of the com- 
munity. In this spirit, this contributions aims also at encouraging further discussion 
of online editions in the papyrological community. 


2 An example: DDbDP 2015 1 


Let us have a look at one of the editions, DDbDP 2015 f? (= P.Got. 54 descr.). The layout 
of this edition is essentially the same as in the case of already published texts in 
DDbDP. The only major difference is the presence of an extensive introduction and 
commentary.” The edition begins with a summary of the HGV, TM and APIS 
metadata.” Next follows the introduction, the image and the translation in a 
convenient layout.” The edition ends with the text and apparatus and notes to 
individual lines (see Figg. 1-2-3). 


4 DDbDP 2015 1: R. AST —L. BERKES, “A Letter from Athanasios the Scholastikos Mentioning Constan- 
tinople” (http://papyri.info/ddbdp/ddbdp;2015;1); DDbDP 2015 2: A BERNINI, “A Requisition of Work- 
men and Donkeys” (http://papyri.info/ddbdp/ddbdp;2015;2); DDbDP 2015 3: L. BERKES, “A Wife in 
Prison: A Letter from 7th Century Fayum” (http://papyri.info/ddbdp/ddbdp;2015;3). 

5 AST- BERKES — COWEY 2016. Cf. REGGIANI 2017, 241. 

6 These were announced (together with DDbDP 2013 1) in AST — BERKES — COWEY 2017. 

7 DDbDP 2016 1: http://papyri.info/ddbdp/ddbdp;2016;1. 

8 DDbDP 2016 2: http://papyri.info/ddbdp/ddbdp;2016;2. 

9 For discussion of the criteria behind this referencing system see below. 

10 It is possible in the DDbDP to add introduction and line-by-line comments to any published pa- 
pyrus as well: cf. REGGIANI 2017, 241. 

11 The content of APIS has been integrated into Papyri.info. 

12 Thelayout of the edition may depend on browser settings. 
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ddbdp.2015.1 = HGV P.Got. 54 (DDbDP 2015 1) = Trismegistos 36154 = gothenburg.apis.45 = p.got.54 
metadata ert open in editor Canoneal URI: hip; //eapyrunfol44040/44240:2015:1 


MOV data Trans APG caos record weon mages? rena 


HGV: P.Got. 54 (DDbDP 2015 1) [source] [xml] 
Title A Letter from Athanasios the Scholastikos Mentioning Constantinople 
Publications P.Got. 54 (DDbDP 20151) [Moreinseries P.Got.] 
P.Got. 54 

Support/Dimensions Papyrus 
unbekannt [More from unbekannt] 
Papyrus 
2. Hälfte VI -641 [Mace from the period between s51 CE aad 641 CE] 
Neuedition bei PN: <http://www.papyri.info/ddbdp/p.got;;54dgtl/>. 
Brief; Athanasios, der Scholastikos, an N.N. 


LE © Heidelberger Gesamtverzeichnis der griechischen Papyrusurkunden Agyptens. This work is licensed under a 


Publications P. Got. 54 

Inv. no. ğ Ini 

Date AD 500 - 699 
Language Greek 

Provenance Egypt [found & written] 


Catalog Record: gothenburg.apis.45 [xml] 

Title Lettre 

Summary Christian letter 

Citations P. Got. 54 deser. [TM 36154] 

Inv. Id P. Got. inv. 38 

Support/Dimensions Pap.; 13x i cm 
Provenance unknown (Province of Egypt) [More fram Provenance uakaowa (Province of Egypt) 
Papyri Gothoburgenses were, with the assistance of prof. Wilhelm Schubart, acquired partly from a private dealer in Germany in 
Chronique d'Égypte 13 - 14, 1932, p. 327 
Greek 
VI-VII ALD. [More frosa the period between 500 CE and 699 CE] 

Note (general) Location: Gothenburg University Library 

Note (general) Source of description: Recto and verso 

Note (general) Recto: 7 lines against the fibres: Verso: 1 line with the fibres 


Fig. 1: The presentation of metadata. 


Introduction 
A Letter from Athanasios the Scholastikos Mentioning Constantinople 
Rodney Ast - Lajos Berkes 


P.Got. 54 («P.Got. inv. 38) was published in 1929 by H. Frisk as a descriptum without commentary or 
translation. The address on the back was not transcribed at all. We therefore offer here a full edition of 
the complete papyrus. 


What survives is the upper left part of a business letter measuring 13 (w) x 11 (h) cm. Writing runs 
against the fibers, transversa charta, on the front and along the fibers on the back. Given the fairly 
clean break on the right side of the document, we suppose that the papyrus broke along its central fold, 
meaning that ca. half of each line is lost on the right. This would give us a reconstructed width of 26 cm 
for the document, a reasonable height for the original roll from which the sheet came. Blank spaces are 
visible in several lines (see especially |. 5), apparently employed as punctuation. The script points to a 
date in the late 6th, cf. P.Heid, 3 247 (5 - 25 Nov. 598), or first half of the seventh century, cf. P.Oxy, 16 
1843 (623); the address on the back is in a stylized script also typical of the period. The mention of 
Constantinople in 1. 7 implies that the letter was written before the Arab conquest (641) and not under 
Persian occupation of Egypt (619-620). The papyrus’ provenance may well be the Arsinoite nome, but 


we suggest this only because the majority of the Gothenburg papyri come from the Fayum. There is 
nothing in the text itself that hints at where it was written. 


Two people are mentioned, a man called Hypatios (1. 6) and the sender, a certain scholastikos named 
Athanasios (cf. |. 8n.). The addressee, whose name is no longer extant, was a high-ranking person, gothenburg aps 451 * 
perhaps a dux, as suggested by the reference to him as éxcpoxt (cf. 3n). The high register of the word 4 
points to a context in the upper echelons of the Byzantine administration, and the involvement of a hase contact a hrean lay pe i ots any onn APIS or bo HR 
scholastikos fits this scenario. In preparing for his career, Athanasios will have received training in 
rhetoric and at least an elementary law education (see CPR 24 4.4n., with further bibliography). We : 
presume, therefore, that he was either a professional lawyer or a high-ranking administration official. HGV 36154 Translation (Englisch) [xmi] 
r 
In keeping with the high register of the document, there is a reference in line 7 to the city of 
Constantinople in the phrase xata rn v faoviba rav 1[óAcow. Designation of the city as ñ Paou c 
tav 160v is found in other documents, see, e.g., PS 1 76.5 (6 May 572 or 573) and P.Eirene 315.3 _ F The dispute about 
(2nd half of 6th c.), and there are numerous examples in late authors, as a search in TLG of xata. t v +. 0n the 20th of the month of Choiak to the .... 
Baouiba tav 1óAtuv demonstrates. Although he considered Bacui&a, H. Frisk opted in the ed.pr. to -. [your] eminence the judgments?) and . . . 
print kata tm v auiba rav v[; be abandoned the reading presumably because the first letter of the set in motion in sum... 
word does not look like a typical beta. Dieter Hagedorn, however, also proposed kata tn v faoviba to state; but since God has granted . . 
xXàv mò).ccov, stating in an email that "Beta hat in der Spitzeit oft ein abenteuerliches Aussehen." We 


Fig. 2: The layout of the edition: introduction, image, and translation. 
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Fig. 3: The layout of the edition: transcription, apparatus, and commentary. 


This structure preserves the traditional, well-established format of papyrological 
editions, but offers some distinct practical advantages. Scholarly work - even in the 
humanities — has been heavily relying on digital resources over the past decades: re- 
search is basically unthinkable without the extensive use of databases, journals and 
books available online. This is especially true for papyrology, since the field offers a 
wide range of excellent and indispensable databases for its practitioners. Further- 
more, engaging with the discipline often requires dealing with scattered information: 
where to find an image of the papyrus? Which textual corrections or interpretative 
suggestions have been made to the ancient texts one is working on? Finding this 
information is relatively easy in the very well organized field of Papyrology, but after 
finding the references, scholars often end up searching other databases for the 
referenced images, texts, articles, and books online.? 


13 For discussion of the methodological advantages and the development of papyrological digital 
resources, cf. REGGIANI 2017, passim. 
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The digital editions offered in the DDbDP clearly faciliate this process. The 
(downloadable and zoomable) image appears next to the papyrus, which renders 
checking the readings very easy.“ Metadata and links to the relevant entries of other 
important databases in the field can be found in the introduction, while an effort has 
been made to give direct links to all papyrological texts in the DDbDP and the 
referenced articles, when an online version is available — which is increasingly the 
case. The reader gets all the information, which he or she would otherwise collect 
from different sources, in one package. 


3 Technical challenges 


I have tried to demonstrate the advantages of digital editions by using the example of 
DDbDP 2015 1. However, it has to be said that producing these editions was a difficult 
undertaking often hampered by technical challenges. Sometimes the display does not 
show the result one would expect even though the XML markup is correct. There were 
numerous problems in laying out the texts, which often required ad hoc solutions. An 
example are raised letters which are often used to designate second editions. 
However, these cannot be used in the commentary or the introduction: they have to 
be represented in other ways. A more serious problem is the case of PDF files: one of 
the most important features, which the scholarly community would expect, is that 
online editions can be converted into downloadable PDF files, i.e. print versions. This 
way digital editions could be accessible in a more traditional, tangible form. It seems, 
however, that this is technically a much more complex issue than one would expect. 
There is unfortunately no easy way at present to transform the digital editions made 
at DDbDP into PDF files. 

There are also limitations on encoding those features in Leiden+/XML that were 
already difficult to handle during the digitization of print publications. It is very 
difficult, for example, to reproduce the layout of the papyri at present. This is not that 
visible in the case of papyri published in the DDbDP so far, but there would be certain 
cases (e.g. accounts with specific layout) that would be very difficult to reproduce. It 
is also impossible at the moment to represent abbreviations in the apparatus, as in 
printed editions. The abbreviations of the address in DDbDP 2015 1 (] t AGaváotog oDv 
6(£à) oyoA(aotixóc) dpetep(06) ) would have been represented in the apparatus of a 
printed edition in the following manner: o*vvoy'ouuetep pap. Whether one includes 


14The images are of course hosted by the Gothenburg University Library. The online publication of 
images not available on the instituional websites of their respective holders and copyright owners 
may be an additional challenge in other cases. 

15If the whole article was not available online, the relevant entry in the online Bibliographie 
Papyrologique via http://papyri.info/bibliosearch was linked. 
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this in the apparatus is very much up to editorial traditions and preferences; one may 
argue, in fact, that the presence of a high quality image of the papyrus renders 
detailed approximations of abbreviations superfluous. However, a digital edition 
could offer more in this case: it would be certainly possible to link the abbreviations 
in the text directly to the image. ! Even individual letters of the transcription could be 
matched with the image and thus online editions could become an excellent tool for 
self-study.” 

These technical issues have imposed some limitations on the editions published 
in the DDbDP, but there is no doubt that all these problems could be solved. If such 
digital editions are be accepted by papyrologists, it will be only a question of time and 
money to create an editorial platform at Papyri.info that can serve all the needs of 
traditional editions. The question is rather where the priorities of papyrologists lie, or 
in other words: is this worth the effort in a field with such limited resources? Another 
question may be asked at this point as well: should we expect online editions to 
conform to the norms of traditional printed editions or should we accept them as a 
slightly different form of publication? 


4 Referencing digital editions 


One of the most difficult problems we faced after creating the editions was their 
naming and referencing. There are two major issues here: 1. How should these 
editions be integrated in the standard reference system of papyrology, Checklist of 
Editions of Greek, Latin, Demotic, and Coptic Papyri, Ostraca, and Tablets (hereafter 
Checklist)?" 2. Since these texts are part of Papyri.info, their text, introduction and 
commentary can be emended and updated by users and these improvements may 
change the ‘original’ edition. This would imply that these editions have no stable 
form, but are on some level fluid.” This ‘fluidity’ obviously creates difficulties in ref- 
erencing: how can this problem be dealt with? 

Both issues have created significant discussion among editors of the DDbDP and 
the solutions are preliminary. In what follows, I will outline the main lines of thinking 


16 This method is being applied on literary papyri in the Anagnosis project: http://www.kallimachos. 
de/kallimachos/index.php/Anagnosis (see the chapter by R. Ast and H. Essler in the present volume). 
17 For the usefulness of online paleographies cf. PapPal (http://www.pappal.info/). For an online 
papyrological school see the online Arabic Papyrology School (http://www.apd.gwi.uni-muenchen. 
de:8080/aps/home/), which uses the method of matching letter forms with the transcription in order 
to introduce students to the paleography of Arabic papyri. 

18 http://papyri.info/docs/checklist. See REGGIANI 2017, 23 ff. and 29 ff. on the Checklist and biblio- 
graphical standards in Papyrology. 

19 Cf. REGGIANI 2017, 241. 
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behind the solutions which have been implemented so far. Since editors themselves 
disagree on some of these questions, it has been decided to wait for the reactions of 
the papyrological community and continue developing the form of the editions based 
on its reactions. 

The papyri which have been published in the DDbDP so far were all described 
before and had a corresponding reference in the Checklist. This made things easier in 
a way, since we could have opted for slightly modifying the already existing refer- 
ences of these papyri, but at the same time complicated issues even more, since we 
did not want to introduce anomalies in the Checklist. Furthermore, we also had to 
consider a reference system for papyri that did not have a Checklist-identifier yet, so 
that it could be used for further publications. 

At first sight, these publications represent the same case as papyri published in 
journals. They could have been referenced in a similar fashion and then included in 
the Sammelbuch griechischer Urkunden aus Agypten, as usual. However, an important 
argument came up quickly: why should we double the effort, if the texts printed in 
the Sammelbuch are reentered into the DDbDP again? This is especially true 
considering that it is inevitable in the long run that the Sammelbuch will be published 
digitally (only?). It was also important to indicate the date of publications, since this 
way these editions become comparable to journal publications. Taking all these 
considerations into account it was decided to use the identifier DDbDP year number, 
e.g. DDbDP 2015 1: this identifier represents a collection of digital-only publications.” 
The numbering follows the sequence in which the submitted publications were 
included in the DDbDP. This creates a clear and transparent system that enables 
straightforward referencing of editions born at Papyri.info. These editions are 
regularly announced through the Bulletin of Online Emendations to Papyri (BOEP)” in 
order to make their existence known to the papyrological community. 

The other issue is what I have called the ‘fluidity’ of these texts. Papyri.info ena- 
bles editing the text, introduction, and commentary - essentially any part of these 
digital editions. This leads to obvious problems in referencing them. For instance, let 
us say someone refers in an article published in 2018 to the introduction of DDbDP 
2015 1. However, in 2019 a user of the DDbDP proposes a change to the introduction 
of DDbDP 2015 1 that affects exactly the part of the text which was referred to in 2018. 
His or her suggestion is reviewed and accepted by the editorial board of the DDbDP 
and subsequently replaces the text referred to in the 2018 article. If someone checks 
the 2018 article in 2020 and wants to look up the reference, he or she will not find it. 
The information that the introduction has changed would be available in the editorial 


20 It has not been decided yet whether these publications will be included in the Sammelbuch or not. 
21 Edited by R. Ast, L. BERKES, and J. M.S. Cowey: http://www.uni-heidelberg.de/fakultaeten/ 
philosophie/zaw/papy/projekt/bulletin.html 
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history of DDbDP 2015 1, but this is not an obvious or user-friendly place to look for 
this information. 

This is a problem at the present, even if it is only a theoretical one, since no 
changes have been made to these editions so far. The editorial board of the DDbDP 
has debated how to deal with such scenarios, but there are basically two solutions. 
The more traditional way is to track the different versions of these editions. This 
means that in theory something like DDbDP 2015 1 (2), 1 (3) etc. would come into ex- 
istence each time the edition would change. In an ideal world the user would be able 
to switch between these versions with the differences being highlighted. However, 
this is impossible to do presently. 

Another way of thinking would be to accept these digital editions as a new form 
of scholarly publication that is not as stable as the traditional ones. This would imply 
that scholars would need to get used to the idea that the texts they find online can 
change anytime and that they need to check them each time before quoting them and 
always refer to them with a date of access. This approach has certainly some appeal, 
as it creates a faster, more direct way to do scholarly work, but there are also caveat-s. 
This method may lead to chaotic references and a certain lack of transparency. How- 
ever, we also have to accept that at some level this is already becoming the reality of 
scholarly work. Discussing drafts publicly on an online platform (e.g. at https:// 
www.academia.edu) has become increasingly common even in the humanities (this 
practice is much more widespread in the sciences). We all refer in papyrology to un- 
published documents, drafts, personal communications of colleagues: in the past, 
even communications “per litteram” were normally cited and accepted as scholarly 
references. These are in a way also messy references, since they cannot be easily 
checked or verified, especially since some of this information never becomes publicly 
available. If we want to fully exploit the possibilities of digital publications, we may 
need to accept this ‘fluidity’. 

Finally, I would like to mention a further minor problem in referencing the text: 
there is no traditional page numbering. In my view, however, this is not a real 
problem. First of all it is very common in Papyrology to refer only to introductions or 
commentaries of editions without mentioning the page number. But what is more 
important: references can be very quickly found in an online environment through 
the search function of the browser. Even though some references to online editions 
may seem vague at first, it is very easy to deal with them practically. 
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Editorial History; All History; (detailed) 


2016-03-10T10:16:41-05:00 [james.cowey]: Finalized - Ready. 


2015-12-16T09:23:00-05:00 [simoeis]: Finalized - Ready 

2015-12-16T09:19:11-05:00 [simoeis]: Vote - Accept-Straight-to-Finalization - Fine 
2015-12-15T10:55:21-05:00 [simoeis]: Submit - listBibl has been removed 
2015-06-11T08:55:16.459-04:00 [james.cowey]: Finalized - Ready. This edition has 
undergone the peer review process having been vetted by two senior editors. Their 
suggestions and corrections have been taken into account. 
2015-06-11T08:55:16.451-04:00 [james.cowey]: Vote - Accept-Straight-to- 
Finalization - Fine. 

2015-06-11T08:46:09.179-04:00 [james.cowey]: Resubmitting an edition prepared by 
R. Ast and L. Berkes, because problems with finalization had arisen 
2015-05-28T08:30:40.46-04:00 [berkes.lajos]: We are submitting a full edition of 
this papyrus, including revised text, HGV data, commentary, and translation. We have 
taken into account suggestions made by the readers. 


Fig. 4: The editorial history of DDbDP 2015 1. 


5 Perspectives of digital editions 


I have tried to demonstrate the main advantages and the accompanying problems of 
digital editions at Papyri.info. As I have emphasized: it is still an open question 
whether this format will be accepted and developed. If it is accepted, it would 
certainly represent a new, more fluid form of textual editions in our field. If we try to 
look at the bigger picture, this model offers some distinct advantages beside the prac- 
tical ones I have mentioned so far. Publication through this platform is open-access, 
peer-reviewed, and fast. Once an edition has been properly vetted, it can be published 
without further delay. This platform may also offer the possibility to quickly describe 
or publish smaller fragments that could swiftly enter DDbDP this way and thus be- 
come searchable. This of course should not imply that Papyri.info limits itself to the 
publication of small pieces that would otherwise be ignored. 

There are also some problems with this model. One issue is the appreciation of 
online, open-access publications in our field and the humanities more generally. 
Even though most scholars agree that such publications are desirable and represent 
the future, they are still not really valued. If someone were to decide to publish a pa- 
pyrus in a well-established journal or at DDbDP, it is pretty clear that the former pos- 
sibility is much better for one’s CV, even if submissions to the DDbDP go through a 
peer-review process. An additional problem in this respect is that while in an article 
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one can bundle several papyri together, this is not possible at present at Papyri.info.2 
Only time can tell how fast the attitude towards digital publications will change in 
our field. However, it is pretty likely that this will happen, as has been the case in the 
sciences for quite some time. 

Another problem is the issue of limited resources in our field. Papyri.info is indis- 
pensable at the moment, but is struggling to keep up with new publications, BL en- 
tries and other corrections, etc.” So the question is: what are our priorities? The focus 
of the editorial board of Papyri.info has been very pragmatic: making as many new 
texts as possible available. This has led to certain compromises. For instance, alter- 
native readings are often not entered during the entry stage and many (BL and other) 
corrections are still missing. We believe that it is better to have more material online 
(even with some infelicities) than to stick to a more limited, but also more flawless 
corpus. 

Focusing more on publishing papyri on Papyri.info would certainly also require 
an effort from the papyrological community: scholars would need to vet the incoming 
submissions and be open to treating editions at Papyri.info the same way as they 
would treat printed publications. The online publication does not have to stop with 
documentary texts; the existence of the Digital Corpus of Literary Papyri (DCLP) at 
http://www.litpap.info could also open the door to editing literary papyri using es- 
sentially the same platform.” Languages do not have to be a limit either: at present it 
is possible to publish Latin, Greek, and Coptic papyri at Papyri.info and the language 
of publication is not restricted to English.” The potential is certainly there, but prior- 
ities need to be set. 

I believe that the only way to find out whether Papyri.info could work as a plat- 
form for editing papyri is to give it more practice. We need to see whether this kind of 
edition and reference system works for Papyrology or not. It may turn out at the end 
that some (inevitable) chaos in referencing these texts is outweighed by the ad- 
vantages that this kind of direct scholarly work provides; on the other hand, it is also 
possible that papyrologists decide to stick with more traditional forms of publication 
or prefer other digital options. However, to assess this we would need more online 
publications. At moment, anybody can submit a papyrus for publication at Pa- 
pyri.info. It is hoped that this article will encourage scholars to explore the possibili- 
ties of editing papyri on this platform or at least to open a dialogue about its ad- 
vantages and disadvantages. 


22 On the other hand, producing a digital-only volume is certainly possible. 

23 However, as R. Ast pointed out to me, the situation was much worse in the late 1990s and early 
2000s: the difference is that users’ expectations are much higher now. 

24 Cf. REGGIANI 2017, 250 ff. and the chapter by R. Ast and H. Essler in the present volume. 

25 In fact, the editorial histories of certain submissions at Papyri.info often contain discussion in 
French, German, Italian or even other languages. 
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Massimo Magnani 


The Other Side of the River 


Digital Editions of Ancient Greek Texts Involving Papyrus Witnesses 


1 Introduction 


More than forty years ago, on the footsteps of Dom Froger (1968), WEST 1973, 70-2 
asked himself for which editorial task the use of the computer could have helped. 
After having discarded the automatic collation,! West imagined that a computer, pro- 
vided with the transcriptions of the manuscripts “purged of coincidental errors”, 
could have drawn up “a clumsy and unselective critical apparatus”. Then, if contam- 
inations have been out of question, this computer could have worked out “an ‘unor- 
iented’ stemma” by comparing the variants, but it could not have been able to choose 
the correct orientation of the stemma, an operation possible only “by evaluating the 
quality of the variants".? Thereafter, it has been assumed that the computer might be 
useful even for an heavily contaminated paradosis: the late Bryan Hainsworth, pre- 
senting in short the manuscript tradition of the Odyssey, imagined that a computer 
could establish the degree of contamination of every single family of the Odyssey’s 
paradosis, but in his opinion the result would not be commensurate with the amount 
of work.’ After years, what is the situation, with reference to the ancient Greek litera- 
ture and texts? If we apply to the term ‘edition’ the usual scholarly meaning, i.e. ‘edi- 
tion of a text based on the method(s) of textual criticism’, and we expect that the ex- 
pression ‘digital edition’ can refer, among the variety of the editorial products, not 
only to the more or less refined digitization of old or new ‘traditional’ editions of an- 
cient Greek texts,“ but also to the ‘edition of a text based on a new digital method of 
textual criticism’, we are destined to disappointment, at least for the time being, and 
not surprisingly. 


1 “Machines have not yet been devised which can cope with variations inherent in handwriting”, 
p. 71. Transkribus promises to find a solution to the problem of the automatic transcription (https:// 
transkribus.eu/Transkribus). For the automatic collation of transcriptions, see CollateX (https:// 
collatex.net/about), the successors of the well-known Peter Robinson’s Collate, the equally well- 
known Wilhelm Ott’s TUSTEP (http://www.tustep.uni-tuebingen.de/tustep_eng.html), and finally 
Juxta (http://www.juxtasoftware.org). West’s ‘traditional’ critical edition of the Odyssey was recently 
published posthumously (WEST 2017). 

2 WEST 1973, 72, and see the example on p. 71. 

3 HAINSWORTH 1982, xxxiv. 

4 Texts transmitted by multiple witnesses and editions produced via the traditional, post-Lachmann- 
ian textual criticism. 


8 Open Access. © 2018 Massimo Magnani, published by De Gruyter. [COBAAN] This work is licensed under the 
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110547450-005 
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2 ‘Digital editions’ of classical texts: an overview 


The Catalogue of Digital Editions, the most complete “attempt to survey and identify 
best practice in the field of digital scholarly editing",? gathers 256 ‘digital editions’, 
among which 12 edition of ancient Greek texts (30.10.2017).° In fact, none of them is a 
new edition based on textual criticism applied to a multi-manuscript tradition: 11 edi- 
tions are catalogued as ‘digital scholarly editions’;’ six of them are editions of a text 
transmitted by unique witness (4 are editions of epigraphic collections,? two involve 
the text transmitted either by a single’ or by a peculiar and very valuable ancient man- 
uscript!°), two are the digitization of the standard, ‘traditional’ editions of the Old and 
New Testament in Greek language (ID 163 = LXX Septuaginta — http://septuaginta.net; 
73 = Digital Nestle-Aland — http://nestlealand.uni-muenster.de), one is the edition 
that gathers the electronic editions of the Gospel according to St. John in Greek, Latin, 
Syriac and Coptic (ID 58 = http://www.iohannes.com). Finally, I will not include the 
Digital Athenaeus among the digital scholarly editions, since it is the digitization of 


5 A project of P. Andorfer and Ks. Zaytseva of the Austrian Centre for Digital Humanities (ACDH, Vi- 
enna) and G. Franzini of the UCL Centre for Digital Humanities (UCLDH, London); for the data, see 
FRANZINI 2012-; for the web site, see FRANZINI — ANDORFER — ZAYTSEVA 2016-. 

6 See also the Catalogue of Scholarly Digital Editions, compiled by P. Sahle (http://www.digitale-edi- 
tion.de). Neither the DIG-ED-CAT nor Sahle’s Catalogue register the Homer Multitext Project (http:// 
www.homermultitext.org). On the subject of the digital scholarly editions, see in general PIERAZZO 
2015. 

7 DIG-ED-CAT ID 10, 24, 57, 58, 73, 86, 92, 93, 100, 163, 244. The ID 17, the edition of the Euripides 
scholia managed by D.J. Mastronarde (http://euripidesscholia.org/EurSchHome.html), started in 
2007 and currently updated in November 2017, is not included in the catalogue of the digital scholarly 
editions not because this edition is not digital, but because it is credited not to be a scholarly edition, 
term by which the DIG-ED-CAT project managers “mean editions with a strong critical component” 
(https://dig-ed-cat.acdh.oeaw.ac.at/faq). I am not able to understand why the Euripides scholia are 
not a ‘scholarly’ edition, but an edition like Sappho's poems (http://inamidst.com/stuff/sappho), “an 
attempt to collect Sappho’s entire work together in one page — with Greek originals, succinct [?] trans- 
lations, and commentary [in fact, absent]”, it is. Obviously, these catalogues cannot provide any guar- 
antee of completeness. 

8 ID 24 (IOSPE = Inscriptions of the Northern Black Sea - http://iospe.kcl.ac.uk/corpus/index.html), 
86 (IGCR = Inscriptiones Graecae in Croatia repertae — http://www.ffzg.unizg.hr/klafil/dokuwiki/ 
doku.php/z:epidoc-hrvatska), 92 (InsAph = Inscriptions of Aphrodisias Project — http://insaph. 
kcl.ac.uk/index.html), 93 (IRCyr = Inscriptions of Roman Cyrenaica - https://ircyr.kcl.ac.uk). For the 
s.c. “special types of edition”, i.e. the edition of papyri and inscriptions, see WEST 1973, 94-5. 

9 ID 57 (Derveni Papyrus - https://chs.harvard.edu/CHS/article/display/5418). 

10 ID 10 (Codex Sinaiticus — http://codexsinaiticus.org/en). The DIG-ED-CAT does not include a digi- 
tal scholarly edition very similar to the Codex Sinaiticus, that is Palamedes (PALimpsestorum Aetatis 
Mediae EDitiones Et Studia — http://www.palamedes.uni-goettingen.de), the edition of the palimpsest 
mss. Hierosol. Sancti Sepulcri 36 and Par. Gr. 1330. 
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Kaibel’s text (with tools).! The same situation has been recently underlined by Er- 
manno Malaspina with reference not only to the ancient Greek domain but also to the 
philological studies in general.” An exception, once finalized, should be his project 
of digital edition of the Ciceronian Lucullus (in collaboration with MeDiHum, Turin 
University, e IAS, Durham University): a ‘Lachmannian’ critical edition of a classical 
text transmitted by 74 mss. arranged in a closed recension, a sort of ‘crash test’ of the 
DH resources, and most of all of the TEI encoding.” The final result will be a web page 
with critical text and apparatus, with the possibility to open windows with the text of 
individual manuscripts or editions in correspondence with their variant readings of 
ecdotic significance. To mark a difference from the mere digitization of a ‘traditional’ 
critical edition, the apparatus of this Lucullus online aims to be much richer in data 
without yielding to the principles of genetic philology, but offering all that is relevant 
for the tradition of the text. In 2015 Malaspina performed a transcript from Word to 
XML through Oxygen of the readings of the 74 manuscripts and of some printed edi- 
tions limited to 4 text-paragraphs. By following the TEI guidelines (ch. 12), each vari- 
ant reading was tagged («rdg») referring through the @wit attribute to the list of the 
mss., and through one of the four @type attributes to its ecdotic classification (in 
order of increasing relevance: orthographical variants, polygenetic variants, variants 
significant for the constitution of the text, variants significant for establishing the 


11 ID 244 (http://digitalathenaeus.org). The Digital Athenaeus is a “work [...] focused on annotating 
quotations and text reuses in the Deipnosophists in order to [...] provide an inventory of authors and 
works cited by Athenaeus, and to implement a data model for identifying, analyzing, and citing 
uniquely instances of text reuse in the Deipnosophists”, where the Greek text is the digitization of the 
Teubner edition of KAIBEL 1887-90 without critical apparatus (see also BERTI — DANIELS — STRICKLAND 
— VINCENT-DOBBINS 2016; BERTI — BLACKWELL — DANIELS — STRICKLAND — VINCENT-DOBBINS 2016). S.D. Ol- 
son is at work, in order to produce a new, ‘traditional’ critical text of the Deipnosophists (the first 
volume of this edition will be published in 2018). 

12 Malaspina in MALASPINA — DELLA CALCE 2017, 58-9: “Con la formula ‘edizione digitale’ oggi si in- 
tende praticamente di tutto: riproduzioni di epigrafi, scansioni di brogliacci, collazioni di varianti e 
così via. Anche se si aggiunge l’aggettivo ‘critica’, che nella filologia classica tradizionale indiche- 
rebbe un prodotto ben definito, non si ottiene un quadro più omogeneo e soprattutto si vede spesso 
gabellato per ‘critico’ ciò che sarebbe più onesto definire ‘diplomatico’, ovvero la mera trascrizione di 
un testimone e/o la giustapposizione di varianti senza nemmeno porsi il problema della vera lectio”. 
With reference to the Romance studies, Rinoldi in BERNAGOU — PALUMBO — RINOLDI 2016, 41-4 under- 
lines some consequences of the increasing online diffusion of the ‘virtual manuscripts’, if compared 
to the lack of digital critical editions: even though the application of this approach should be ideally 
restricted to documentary texts, texts transmitted by a single ms., and particularly venerable mss., 
the alignment of reproductions and transcripts of mss. could promote the return of the bad Bédierism, 
that is the fetish of the bon manuscrit (see below). See PIERAZZO 2011 on the digital scholarly edition of 
documentary texts. 

13 Malaspina in MALASPINA — DELLA CALCE 2017, 58-62. 
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stemma codicum). The JavaScript prototype, performed by Peter Heslin (Diogenes’ in- 
ventor), allows for the data display and the control of the tagging errors; the last stage 
will be the synoptic display of the text of each witness. 

The lack of “‘comprehensively digital’ scholarly edition of a ‘classical’ text with a 
manuscript-based multi-testimonial tradition” was noticed by MONELLA 2012.! Accord- 
ing to the responses given to his cognitive survey on Academia.edu and Digital Classi- 
cist, the main reason should be “time and money”, but Monella believes that the real 
reason is the lack of need. Digital scholarly editions (DSE) are favourite scholarly prod- 
ucts of codicologists, epigraphists, papyrologists, editors of documentary manuscripts 
and palaeographers, says Monella, because they are focused on documents, and by 


‘genetic’ editors of modern and contemporary texts [...]° and historical linguists, who may study 
the evolution of language and orthography through ‘errors’ in inscriptions, in manuscripts and 
in modern print materials throughout the centuries, 


because their concern is the textual variance. Classical editors who are dealing with 
texts transmitted by a “manuscript-based multi-testimonial tradition” are dealing 
with ‘canonized’ ancient texts, where textual variance is due — or credited to be due 
— for the most part to the erroneous medieval paradosis and not to the author. ‘Errors’ 
are identified and collected only for establishing the stemma codicum and the text 
itself. Finally, continues Monella, for classical editors the manuscript is important es- 
pecially as “textual vehicle' and does not bear a particular relevance in itself. There- 
fore, e.g., why should the Aeneid's editor digitize even a limited part of this manu- 
script tradition? The purpose should not be the expansion of the critical apparatuses 
with the inclusion of more data - usually, this purpose is disregarded by the ‘tradi- 
tional’ editor because of their editorial irrelevance, but “a plural, fluid concept of text, 
a concept implying that each document's text is worth studying as a historically de- 
termined cultural object", and the increasing interest in “‘post-classical’ Latin and 
Greek - thus joining forces with historical linguists and romance philologists". In my 
opinion, it is not completely true that the ‘traditional’ classical editors and philolo- 
gists are only interested in manuscripts as witnesses of the text: on the contrary, we 
see an ever-growing attention to the ancient and medieval manuscripts as essential 
witnesses of the cultural reception of the related texts. It is acceptable that the digital 
scholarly editions can provide the best framework, in order to manage and display 


14 See also Tomasi in ITALIA — TOMASI 2014, 120, noticing the need of shared criteria such as verifia- 
bility of the sources, reliability of the institution promoting the edition, presence of curators, scientific 
layout, dates of creation and updating, absence of commercial interference (and, hopefully, absence 
of mistakes). In her opinion, to the absence of shared criteria, it is to be added that the DH are some- 
times perceived by philologists as a mere instrument useful for speeding up procedures rather than 
as a new way of understanding the edition, and that the interface often hides the methodological 
accuracy that has governed the process of creating the edition. 

15 See e.g. D'IORIO 2010; Italia in ITALIA — TOMASI 2014, 128-30, especially on the analytic genetic editions. 
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the rich data complex derived by a multi-manuscript paradosis in its entirety (see also 
below). Certainly, this approach can be of great cultural importance and can stimu- 
late the discipline to reconsider its goals and methodology;! moreover, the IT solu- 
tions devised in response to the problems associated with digital editions constitute 
an advancement and it will contribute to create new skills and new professional fig- 
ures, e.g. the ‘computational scholars’, that is “philologists skilled in both classical 
philology and computer science”. 


3 Case study 1: Mastronarde’s Scholia Euripidea 


Before trying a provisional conclusion, I would like to review with some detail two 
digital scholarly editions. Among the aforementioned editions, the only one that is 
really a new edition, even though based on the ‘traditional’ method(s) of textual crit- 
icism, are Mastronarde’s Scholia Euripidea (supra, n. 7), aiming to supersede the 
standard work of SCHWARTZ 1887-91.? As Mastronarde writes, “the goals of this pro- 
ject are quite traditional in a philological sense, but also experimental and forward- 
looking in terms of format”. This view is very instructive. On the one hand, Mastro- 
narde did not use the computer resources, in order to review the collations made by 
the previous editors,” to “clarify the extent, nature, and possible stemmatic relation- 
ships of the scholia in some of the so-called recentiores”, or to put a better order to 
the scholiastic corpora of the Palaeologan era (Maximus Planudes, Manuel Mos- 
chopoulos, Thomas Magister, Demetrius Triclinius). On the other hand, his choice for 
an open-access digital format responds to specified intellectual and educational 
needs, apart other very sensible economic, professional and scientific reasons: “a dig- 
ital format is variable [...], updatable, [...], allows for sharing of interim stages of the 


16 See MONELLA 2012, n. 35, with bibliography, reconsidering “the historical and anthropologi- 
cal/ethnological foundations of the discipline”. 

17 See e.g. BOSCHETTI 2009, 5. 

18 In the update of November 2017 Mastronarde announces the online publication of the edition of 
the scholia on Orestes 1-500 and, in the meantime, the forthcoming open-access publication of the 
Preliminary Studies on the Scholia to Euripides. The page about the 136 sigla used for Euripidean man- 
uscripts is new, with the possibility to download them as Excel spreadsheet (EurSiglaTable.html), 
together with an updated ‘Manuscripts page’ with additions. 

19 The description of the manuscripts is the only part that has been substantially updated (in 2016: 
https://euripidesscholia.org/EurSchMSS.html; in 2017: https://euripidesscholia.org/EurSchMSS 
new. html). Also, Mastronarde has been able to improve Schwartz's collations of M, B, and V; a 
greater progress has been made adding some lemmas and glosses of C, all the scholia in H (the Jeru- 
salem palimpsest, for which see supra, n. 10), ms. unknown to Schwartz, and those in O (Schwartz 
wrongly dated the ms. to 15" century, but it is now credited to be written ca. in 1175). Mastronarde 
does not include for the moment the ancient manuscripts with marginal and interlinear notations, for 
which see MCNAMEE 2007, 253-7. 
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work, [...] is expandable, [...] searchable in a way that a printed volume is not". The 
project is so far limited to the first 500 lines of Euripides’ Orestes and to 20 “witnesses 
of various kinds” of it; the play was selected by Mastronarde first because, 


as a triad play, it provides the maximum degree of variety and complexity in the annotation tradition 
and therefore forces one to confront most or all of the issues that may arise as the work proceeds”, 


then for the availability of images. The problems of conversion of a ‘traditional’ criti- 
cal edition to XML/TEI format have been so relevant, that only Orestes 1-25 and 401- 
25 have been published, with the Triclinian metrical scholia to the parodos and the 
prefatory material.” 

Mastronarde created four levels based on the TEI division-type element:” the 
divi element, the largest one, includes all the material related to the correspondent 
play, including one or two div2 elements, containing the introductory texts and the 
scholia. div3 always has three required attributes and occasionally an optional 
fourth one; the first two give a complete and expandable classification of the scholia 
(“@type = vet, rec, mosch, thom, tri, plan, vetMosch, vetThom, 
vetMoschThom, recMosch, recThom, recMoschThom, moschThom; @subtype 
= exeg, gloss, paraphr, gram, artGloss [“a gloss that consists only of the article 
agreeing with the glossed word”], etaGloss [“an eta placed over a Doric alphaina 
lyric passage to indicate the normal form”]”). The third one required attribute is the 
@xml-id of the play. The div4 elements are the “children of each scholion div3”, 
the only one of these being mandatory is the one including the text ofa single scholion 
with its lemma and its witness list (@t ype of ‘schText’). One of the main problems 
has been the impossibility “to key an apparatus item to a line number”, problem that 
can not be easily overcome, considering that “anchoring each apparatus item to a 
single word or phrase is possible, but the markup would be far too time-consuming 
and in my opinion out of proportion to any possible gain”. After the text of the 
scholion, “a required seg with Gt ype of ‘witnesses’ contains the sigla of the manu- 
scripts that contain the scholion”, then follow seven (or less) other kinds of div4: 
engTrans (only for a choice of scholia), lemmaPosNote (“details about variations 
in the lemma, the presence of reference symbols linking the scholion to the text, and 


20 Not always the XML method is accepted: Schmidt’s Ecdosis (SCHMIDT 2016, 100-1), a back-to-front 
development model providing a user-driven framework, aims to create, publish and share digital 
scholarly editions without using XML, seriously affected by “the problem of ‘markup variability’ - the 
tendency by different encoders to mark up the same features in different ways”. So, “instead of the 
linear text of XML, with embedded tags designed to apply abstract formats, Ecdosis uses a non-linear 
text and separate markup to describe textual properties, which are not arranged hierarchically, as in 
XML, but are allowed to overlap". For the TEI encoding limits, see CUFALO — MUGGITTU 2016, 89-91; in 
many of his contributions Schmidt underlines the structural inadequacy of the embedded general- 
ized markup for cultural heritage texts (see e.g. SCHMIDT 2010), and it is no coincidence that XML / TEI 
are not always adopted for these purposes. 

21 See at https://euripidesscholia.org/EurSchStructure.html. 
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the position of the note”), appCrit and appCrit2 (the second critical apparatus, 
“for orthographica and other minor matters”), and comment Sim (commentary and 
similia). Interesting is the creation of two tags systematically identifying two types of 
information, that in the ‘traditional’ editions are rare and scattered among the mss. 
readings: collNotes (“collation notes, record of difficulties in reading images, of 
divergences from previous reports, and reminders to check the original or a better 
image at some future date”) and keywords, in fact a reminder “for additional descrip- 
tion of the content in aid of searching or indexing at a later point”. Both of them are 
not publicly displayed, but reserved to the author and collaborators for future work. 
Another interesting piece ofinformation, again not always systematically recorded in 
the traditional critical apparatuses, is the location of the scholia (above the line, mar- 
ginal, or intermarginal), the variation of their sequence, and the indication of the 
point where a scholion begins and the other ends. Mastronarde’s choice is due to the 
difficulty of using superscripts in XML, therefore reserved to indicate different hands 
or “different versions of the same note at different locations in the same witness”. 
That Mastronarde’s edition is traditional and digital only for having chosen this for- 
mat of displaying the textual contents is also clear from the treatment of the metrical 
scholia, limited to Triclinius’ scholia on the parodos of Orestes. As Mastronarde’s un- 
derlines, by assigning a different tagging to the metrical scholia, XML allows to dis- 
play the metrical scheme and the text of Triclinius’ mss. side by side with the scholion, 
while DE FAVERI 2002 had to publish them separately, at the back of the book. 
Leaving aside, of course, the differences due to the progress of the studies on the 
manuscript tradition, a limited comparision, restricted to the incipit of the play (and 
to the vetera set of scholia), between Mastronarde’s edition (“full view” mode of dis- 
play) and the standard edition of Schwartz is instructive:” the digital edition shows 
five scholia vetera to Or. 1 (28 overall), all of which transmitted only by V (= A in 
Schwartz),” while Schwartz prints only three of them, with the first and the third 
‘tagged’ with a peculiar [dia]critical sign, a crux indicating not editorial desperation 
but the recent origins of those scholia” (all of them are defined as vetera exegetica in 
Mastronarde’s edition). The text of the three scholia in common does not differ; Mas- 
tronarde’s apparatus is richer, is preceded by the English translation and has addi- 
tional information (variations in the lemma, the presence of reference symbols, or- 
thographical matters, and a brief commentary). The convenience of the digital format 
depends on the aforementioned statement (variability, upgradability, accessibility of 
interim stages of the work, expandability, searchability), but this edition did not ben- 
efit of digital tools neither for the mss. collation nor for the arrangement of the ms. 


22 Seeat https://euripidesscholia.org/EurSiglaTable.html for the different sigla adopted by the mod- 
ern editors of scholia. 

23 Vatican, Biblioteca Apostolica Vaticana, gr. 909, ca. 1250-1280. 

24 SCHWARTZ 1887, viii. 
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witnesses in an open or closed stemma designed for every single scholiastic corpus.” 
Its apparatus has been created by Mastronarde and is not the result of the information 
recovery based on the computer assisted collation of digital transcriptions. 


4 Case study 2: the Homer Multitext Project 


Returning to the initial question, Harris’ warning” can be partially shared, but to my opin- 
ion the problem is not so much that the ordinary reader has no desire to loose him-/herself 
in the maze of variants that every philological operation inevitably generates, but that the 
concern about XML/TEI and its editorial application risks to make us lose sight of the final 
objective of every philological operation: that of establishing a text. On the other hand, I 
completely agree with him that if the application of the IT to the philology, precisely be- 
cause it allows multiple choices, is transformed into the abdication of choice, this appli- 
cation is improper, because the job of the philologist is to choose. In a sense, the most 
ambitious project of digital scholarly edition is the one that has chosen to completely em- 
brace that risk, not simply in order to avoid the choice but denying its methodological 
correctness. With the words of its editors, C. Dué and M. Ebbott, 


the Homer Multitext Project seeks to present the textual transmission of the Iliad and Odyssey in a his- 
torical framework. Such a framework is needed to account for the full reality of a complex medium of 
oral performance that underwent many changes over a long period of time. These changes, as reflected 
in the many texts of Homer, need to be understood in their many different historical contexts. The 
Homer Multitext provides ways to view these contexts both synchronically and diachronically.”” 


Therefore, the Homer Multitext Project offers “free access to a library of texts and im- 
ages, a machine-interface to that library and its indices, and tools to allow readers to 
discover and engage with the Homeric tradition”. Editors (and co-editors: D. Frame, 
L. Muellner, G. Nagy) reject the traditional approach for Iliad and Odyssey and the 
possibility of a critical edition of the poems establishing “an original text as it sup- 
posedly existed at the time and place of its origin".? Persuaded by the idea of a Ho- 
meric tradition always evolving from the pre-classical age to the Byzantine era, the 


25 Veteres and witnesses before ca. 1261, recentiores (witnesses after ca. 1261) containing old scholia, 
witnesses for Moschopulean scholia on the Triad plays, witnesses for Thoman scholia, witnesses for 
Triclinian scholia, witnesses for Miscellaneous Palaeologan scholia. 

26 HARRIS 2014, 84-5. 

27 http://www.homermultitext.org/about.html; see also DUE — EBBOTT 2010, 154: “unlike the stand- 
ard format of printed editions, which intend to offer a reconstruction of an original text as it suppos- 
edly existed at the time and place of its origin, the Homer Multitext offers the tools for discovering, 
viewing, and understanding a variety of texts as they existed in a variety of time and places”. 

28 See DUE - EBBOTT 2010, 153 and already DUE — EBBOTT 2009, 5: “textual criticism as practiced is 
predicated on selection and ‘correction’ as it creates the fiction of a singular text. The digital criticism 
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Homer Multitext editors believe that the textual variants even of the medieval para- 
dosis are the sign of the variation inherent to the system of oral poetry and that even 
when the poems were finally written down, they continued to be performed orally.” 
The consequence is that lliad and Odyssey have never been fixed texts. Therefore, the 
Homer Multitext Project is gradually editing the Homeric witnesses (papyri, medieval 
manuscripts, ancient quotations), considering each manuscript and each textual var- 
iation as the valuable testimony of the oral tradition. We are not told if all the manu- 
scripts will be digitized: six medieval Iliadic codices of well known relevance have 
been completely or partially digitized, not always successfully,” and only one papy- 
rus.” That is, a work in progress. The scientific foundation of it was placed by two 
‘traditional’ publications: “a multitext edition with essays and commentary” of Il. X 
and an overview of the Ptolemaic papyri as witnessing the multitextuality.? This is 
not the place where to discuss in detail the position of Dué, Ebbott and Gregory Nagy 
on the Homeric tradition; nevertheless, this position is the essential reason for their 
peculiar digital edition and therefore deserves a short presentation (and some con- 
siderations). Bird's very compact monograph aims to be a general introduction to tex- 
tual criticism applied first to classical and biblical texts (pp. 1-26), then to Homer in 
general (pp. 27-60), finally, to the Ptolemaic papyri of Homer (pp. 61-100), inter- 
preted as the evidence not of the *eccentricity"? but of the everlasting “multitextual- 
ity” of the ancient Homeric tradition. Bird defines as “authentic” and “original” all 


we are proposing for the Homer Multitext maintains the integrity of each witness to allow for contin- 
ual and dynamic comparison, better reflecting the multiplicity of the textual record and of the oral 
tradition”. 

29 The ‘fluidity’ of the Homeric text is in accordance with the general concept of the digital text; see 
BABEU 2011, ix: “As recently as a generation ago, the ‘text’ in classics was most often defined as a 
definitive edition, a printed artifact that was by nature static, usually edited by a single scholar, and 
representing a compilation and collation of several extant variations. Today, through the power and 
fluidity of digital tools, a text can mean something very different: there may be no canonical artifact, 
but instead a data set of its many variations, with none accorded primacy”. 

30 Venezia, Biblioteca Nazionale Marciana gr. Z. 454 (coll. 0822, the famous Venetus A); Venezia, 
Biblioteca Nazionale Marciana gr. Z. 453 (coll. 0821 = Venetus B); Venezia, Biblioteca Nazionale 
Marciana gr. Z. 458 (coll. 0841 = Allen U4); Escorial, Real Biblioteca fonds principal y. I. 01 (Andrés 
294 = Allen E3 = West E); Escorial, Real Biblioteca fonds principal Q. I. 12 (Andrés 513 = Allen E4 = 
West F); Genéve, Bibliothéque de Genéve fonds principal gr. 44. Only the Veneti A and B are com- 
pletely digitized; we have also a sample of the Genav., while the images of both Scor. are currently 
unavailable. 

31 The Bankes Papyrus (P.Lond.Lit. 28 = TM 60500 = M-P? 1013 = LDAB 1623 = Allen-Sutton-West 14); 
see also https://www.bl.uk/collection-items/the-bankes-homer. 

32 DUÉ - EBBOTT 2010; BIRD 2010. For the following presentation of these books I am partially in- 
debted with the dissertation of MARTINI 2013, 32-45. I take this opportunity to remember the very fruit- 
ful collaboration with Isabella Andorlini in the supervision of Martini’s work. 

33 E.g. WEST 1967. 
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the variant readings that seem Homeric both by nature and by ancestry, but this judg- 
ment will not automatically define the other variants as “spurious” or “unoriginal”: 
in others words, any variant reading that is not an obvious copyist's intervention 
must be considered authentic in principle. Synthetically, when two variants have in- 
ternal and external factors on their side that prove or suggest authenticity, the Homer 
editor must refuse the traditional way of thinking of the philologist that would lead 
to make a choice between the two: “there is no need to choose one reading and reject 
the other”. This way of operating is programmatically opposed to the Lachmannian 
critical setting, also from the point of view of data presentation (i.e. a traditional crit- 
ical edition that provides a main text at the top of the page, accompanied by a presen- 
tation of variant readings in the critical apparatus at the end of the page). For this 
reason, Bird proposed a new layout presenting the variant readings at the same time 
without resorting to the critical apparatus. The purpose is to print multiple versions 
of the text in parallel, creating a “multitext edition of Homer, one that would be ex- 
pected not only to report variant readings but also to relate them as possible to differ- 
ent periods of history”. The weakness of this position is, in my opinion, the un- 
proven equivalence of the aedic phase with the rhapsodic and then the Ptolemaic 
stage of the Homeric tradition — apart from a not entirely convincing and forced anal- 
ogy between the Homeric tradition and that of the New Testament. Not all the variant 
readings have a certain tradition going back to the classical age as the ‘Zenodotean’ 
(and Aeschylean) gaîta in Il. I 5.6 Ptolemaic papyri seem rather to hand down rhap- 
sodic variants,” such as those transmitted from the medieval text of the Homeric 
Hymns. An informative example of the erroneous Bird's approach is his discussion? 
concerning the “minor textual variants” attested for Il. VI 287—8 in P.Sorb. inv. 2302 
(TM 61240 = M-P? 786.1 = LDAB 2380 = Allen-Sutton-West p480a): in his opinion, 
QóÀicccoy katå and 0&Aopoy xateBncato, rightly defined at first as “clear examples of 
spelling reflecting pronunciation",? “convey the memory of a live performance, with 
all its speed and dramatic intensity”; this spelling, continues Bird, “presumably 
would not happen if the lines were being dictated slowly and carefully”. To affirm 
this, the phenomenon should be limited to the Homeric papyri, but it is notoriously 


34 BIRD 2010, 34-6; see also DUE — EBBOTT 2010, 8: “where different written versions record different 
words, but each phrase or line is metrically and contextually sound, we must not necessarily consider 
one ‘correct’ or ‘Homer’ and the other a ‘mistake’ or an ‘interpolation’”. 

35 NAGY 1996, 113. Already DI LUZIO 1969, 151 proposed to insert in the margin of the critical text the 
equivalent variations obtained from the papyri, so that the reader could choose the lesson according 
to his ‘taste’; and the pleasure of choice would not be reserved only for the learned editor. 

36 See BIRD 2010, 34-7. 

37 Itis the case, probably, of Il. VIII 526, discussed by BIRD 2010, 57-8, but see especially his n. 151. 
38 BIRD 2010, 92-6. 

39 BIRD 2010, 95. 
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widespread in texts of various nature, literary as well as documentary. A similar mis- 
interpretation concerns the spelling top pa for tóv pa in Il. XVII 578, attested in 
P.Sorb. inv. 2303 (TM 61117 = M-P? 948.2 = LDAB 2255 = Allen-Sutton-West p501c).^? 
Multitextuality has been also applied by DuÉ — EBBOTT 2010 to the tenth book of 
the Iliad:* the two scholars, in order to avoid presenting a critical text “that obscures 
the multiformity of the oral tradition”, choose to print four witnesses that represent 
the state of the text in different historical periods (a papyrus of the II century BC, one 
of the III AD, one of the VI AD, and the Venetus A). The ‘critical’ text is followed by a 
commentary, which, although not inclusive of all the information of a critical appa- 
ratus, in the author's intentions makes it possible to better explore the differences 
between the witnesses as it is more focused on the chosen texts. In fact, this edition 
is the paper version ofthe digital project, and, as far as the paper edition is concerned, 
there are many reasons for perplexity: the main reason is that, although the authors 
wish ideally to reproduce any text that presents significant variants, they are forced, 
by necessity of space and time, to make a choice regarding the number of witnesses 
to be presented, and the difficulty increases due to the deliberate omission of the crit- 
ical apparatus. The specimina printed in the edition are certainly analysed in detail 
in the commentary and are cited together with other relevant witnesses, but, as ad- 
mitted by the editors themselves, without reaching the level of completeness and sys- 
tematicity that is instead of a critical apparatus. Moreover, such an edition is certainly 
no longer easily consultable with respect to a ‘traditional’ edition, above all because 
of the inconvenience of having to resort each time to the comment, separate from the 
text. The other disadvantages are: no translation is available, if not a few hints in the 
commentary (not even for the papyrus texts), a limited number of witnesses, no sup- 
plements for the papyrus' lacunae. Therefore, for this work the definition of 'critical 
edition' is not appropriate, given that the Dué and Ebbott reject the traditional oper- 
ation, as above anticipated (“we want to avoid presenting a critical text that obscures 
the multiformity of the oral tradition"). They believe that it is not the apparatus but 
the commentary that makes the edition ‘critical’, “but critical in a different way from 
what is usually indicated by the term”.% As regards their definition of ‘textual criti- 
cism', they consider that it is authentically exercised by not judging which text is right 
or wrong, but rather “to criticize what these texts contain in terms of the textual tra- 
dition and the oral tradition that preceded it"; in other words, one can only distin- 
guish what is a trivial scribal error from a genuine oral variation.? This unusual way 
of conceiving the critical edition, however, is the basis of an editorial product that, in 


40 See BIRD 2010, 96-100. 

41 See especially ch. 4, Iliad 10. A Multitextual Approach (pp. 153-66), and parts II and III (Texts and 
Commentary, pp. 169-382). 

42 DUÉ — EBBOTT 2010, 153. 

43 DUÉ — EBBOTT 2009, 7: “our textual criticism of Homeric epic, then, needs to distinguish what may 
genuinely be copying mistake and what are performance multiforms". 
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addition to being extremely uncomfortable in the consultation, in my opinion fails 
precisely in its primary purpose, which is to allow ‘independent use’ by the reader of 
these versions. The editors justify their unconventional editorial choice — which in 
fact is a ‘non-editorial’ choice, as they suspend any judgment on the value of the var- 
iant readings — claiming to join “to the goals of the Homer Multitext” Project, that is 
the realization of a digital scholarly edition of Homer. 

The editors did not hesitate to describe the project as a revolution. The Homer 
Multitext Project stems from the belief that it cannot in any way apply the traditional 
editing system to works such as the Iliad and the Odyssey, because they originated 
from a long oral tradition without the aid of writing: 


in such a tradition in which the composition is occurring in the course of performance, there is 
no one “author of the original composition” to try to recover, for there is not only one composi- 
tion, but also no other author. 


It is the oral nature of the so-called Homeric poems that renders traditional editorial 
methods ineffective: the textual criticism that must be applied to them should not 
move in the direction of a ‘paradigmatic’ (or ‘canonic’) critical text, but should main- 
tain “the integrity of each witness to allow for continual and dynamic comparison”. 
According to the editors’ view, another traditional concept that must be abandoned 
is also that of “variant (reading)”, because it implies a judgment on the quality of the 
variants presented by the manuscripts: we should therefore adopt the more neutral 
term “multiform (reading)”. The logical consequence of this statement is that, if each 
variant reading is potentially ‘authentic’, the traditional critical apparatus no longer 
has any reason to exist: an approach of this kind “deliberately puts some central con- 
cepts and issues of conventional textual scholarship in crisis. The basic text, the text, 
the textual apparatus, and the variant”. Dué and Ebbott state with very explicit and 
categorical words: “the digital Multitext must be fundamentally different from these 
print editions in conception, structure and interface” .‘ One point that is very relevant 
in the Project is the need to present the digitized text of all the witnesses but without 
the critical apparatus. Despite the conspicuous attention attracted by the Homer Mul- 
titext Project, the impression is that, beneath the statements of principle, there is cur- 
rently very little concrete to compare with. In other words, there is a contrast between 
the limpidity and the firm certainty with which the Project guidelines have been pre- 
sented and the indeterminacy with which practical questions are dealt with: e.g., how 
the texts of the manuscripts will be presented in the absence of a text-reference base? 
Which tool will be used to replace the traditional critical apparatus? In the section 
dedicated to the description of the project on the website of the Homer Multitext Project 


44 DUE — EBBOTT 2009, 5. 
45 VANHOUTTE 2007, 165-6. 
46 DUE — EBBOTT 2009, 2. 
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it still remains said that “unlike printed editions [...] the Homer Multitext offers the tools 
for reconstructing a variety of times and places”, without specifying what tools are in- 
volved and how they work. Moreover, the absence of precise guidelines on the structure 
of the Project had been admitted also by Dué and Ebbott," who however had set aside 
the issue, considering it a detail (“but no matter what the details end up being, we have 
committed to three foundational principles: collaboration, open access, and interoper- 
ability").^ It must have been this vagueness in structural terms, together with a certain 
insistence on the importance of leaving the final choice to the reader, to have aroused 
the lucid judgment of M.L. West: *Nagy seems to think that an editor should simply 
marshal the evidence in a non-committal way". While Nagy seems to assume that the 
Homeric editor's most important task is to suspend his own judgment to prevent it from 
undermining his reader's, West defends the right to actively exercise his own, conclud- 
ing that they evidently have “very different concepts of the editor's role".? In Dué’s and 
Ebbott's edition, the text of the 10% book of the Iliad becomes therefore four different 
Iliadic texts, three ancient and partial,” the fourth, medieval and complete (Venetus A), 
each witness of the Homeric perennial multitextuality. Despite the absolute peculiarity 
of the Homeric tradition, a methodology of this type leads not only to the renunciation 
of the critical text, as said, but also to the renunciation of a critical approach tout court: 
itis revealing the treatment of the variant (or better, multiform) readings, each of which 
seems a priori to testify to an aedic (re)composition-in-performance, even though the 
editors tend to overlook their possible, often probable un-aedic origin (rhapsodic vari- 
ants, glosses, conjectures).* 


5 Conclusions 


As in many other scientific fields, the implementation of IT systems can lead to a 
methodological renewal only after a dissemination of their scientific use and only af- 
ter having produced results superior to those obtained by past philologists with the 
traditional methods. This scenario could be achieved through better quantitative and 


47 See also DUE — EBBOTT 2009, 33: “as we continue to build up our collection of texts, there are still 
questions to be answered about how to construct the architecture to achieve the visual representation 
we envision and that will achieve the results we have described here”. 

48 Three magic words in the digital editing. Their relevance is out of the question, but often editors 
do not go beyond the simple enunciation. 

49 WEST 2001, 160 n. 5. 

50 P.Mich. inv. 6972 (TM 61210 = M-P? 864.1 = LDAB 2350 = Allen-Sutton-West p609); P.Berol. inv. 
11911 + 17038 + 17048 + 21155 (TM 60757 = M-P? 855.1 = LDAB 1883 = Allen-Sutton-West p425 + 430); 
P.Cair.Masp. inv. 67172-4 + P.Berol. inv. 10570 + P.Strasb. inv. G 1654 + P.Rein. II 20 (TM 61072 = M-P? 
658 = LDAB 2209 = Allen-Sutton-West p46). 

51 So, e.g., the commentary to Il. X 10 (p. 247). 
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qualitative information processing, especially with reference to the collation of the 
manuscripts and to the stemmatic contamination. The availability of IT tools will in- 
creasingly allow to broaden and better evaluate the data of the manuscript tradition 
and could avoid an excessive limitation of the witnesses only for the need to reduce 
their number. The greater ‘capacity’ of the computer systems certainly allows a simpler 
and more efficient collection of the variant readings, sometimes useless for the consti- 
tution of the text but often valuable to study the textual tradition in a wider cultural 
sense. Regardless of the case of the Homer Multitext Project, there is however the sensa- 
tion of an overestimation of those variant readings not really significant for the consti- 
tution of the text, overestimation which is equally pernicious compared to their scarce 
eight-twentieth-century consideration. Sometimes, this ‘hypervalutation’ seems to be 
promoted by some digital editors after having observed the limited or null ecdotic pro- 
gress of the scholarly work. There are undeniable advantages in the digital scholarly 
editions, that have been clearly illustrated, for example, by Mastronarde (see above), 
but it is also true, and obvious, that the ecdotic improvement derived from a more care- 
ful study of the paradosis is not necessarily produced by computer tools.” There will 
certainly be a moment when the critical editor will be supported or even replaced by the 
AI, but the task of choosing the best possible text in a methodologically correct manner 
cannot but remain the essential purpose of the philological activity for most of the man- 
uscript traditions handed down by a plurality of manuscripts. 
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Part 2: Linguistic Perspectives 


Marja Vierros 
Linguistic Annotation of the Digital 
Papyrological Corpus: Sematia 


1 Introduction: Why to annotate papyri linguistically? 


Linguists who study historical languages usually find the methods of corpus linguis- 
tics exceptionally helpful. When the intuitions of native speakers are lacking, as is 
the case for historical languages, the corpora provide researchers with materials that 
replaces the intuitions on which the researchers of modern languages can rely. Using 
large corpora and computers to count and retrieve information also provides empiri- 
cal back-up from actual language usage. In the case of ancient Greek, the corpus of 
literary texts (e.g. Thesaurus Linguae Graecae or the Greek and Roman Collection in 
the Perseus Digital Library) gives information on the Greek language as it was used in 
lyric poetry, epic, drama, and prose writing; all these literary genres had some artistic 
aims and therefore do not always describe language as it was used in normal commu- 
nication. Ancient written texts rarely reflect the everyday language use, let alone 
speech. However, the corpus of documentary papyri gets close. The writers of the pa- 
pyri vary between professionally trained scribes and some individuals who had only 
rudimentary writing skills. The text types also vary from official decrees and orders to 
small notes and receipts. What they have in common, though, is that they have been 
written for a specific, current need instead of trying to impress a specific audience. 
Documentary papyri represent everyday texts, utilitarian prose,’ and in that respect, 
they provide us a very valuable source of language actually used by common people 
in everyday circumstances. 

This significant text corpus is openly available to us in digital form. The Papyro- 
logical Navigator (PN)? hosts the Duke Databank of Documentary Papyri and provides 
a search engine as well. However, any deeper linguistic research cannot be per- 
formed. The search engine at PN is mainly designed for the needs of historians and 
editors of papyrus texts in locating parallels and sources using word-string searches. 
In order to utilize the text corpus linguistically, it needs to be enriched with linguistic 
information, i.e. it needs to be linguistically annotated? Linguistic annotation can 
concern many different levels of language, usually morphology, syntax, semantics, 


1 Cf. WAGNER — OUTHWAITE — BEINHOFF 2013, 4. 

2 http://papyri.info/. 

3 A very clear textbook on linguistic annotation and corpus linguistics in general is KUBLER — ZINMEIS- 
TER 2015. See also e.g. WYNNE 2005 on developing linguistic corpora. 


8 Open Access. © 2018 Marja Vierros, published by De Gruyter. [COBAAN] This work is licensed under the 
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110547450-006 
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or pragmatics. Even the basic morphological annotation alone can provide for com- 
plex linguistic queries. The literary Greek corpus has recently been automatically 
lemmatized and morphologically parsed.* Greek that is found in papyri deserves to 
be similarly treated so that the literary language can be compared with the utilitarian 
prose found in papyri, enabling our views on historical developments and variation 
of Greek language to be as full as they can be. 

In this paper, I will discuss the criteria and approach which I have chosen while 
planning the Sematia corpus and platform.’ While this is an ongoing process and 
plans often are subject to change, it is still worthwhile to explain what lies behind the 
selected approach, what the future plans are and possible new directions and, finally, 
what can be achieved with all this work. 


2 Corpus design 


One key factor in corpus design generally is that the corpus is representative. Whether 
we want a holistic or strictly selected corpus, depends on the research questions for 
which the corpus is meant to provide answers. If we want answers from a certain do- 
main of texts (e.g. private letters), we select only those texts into the corpus. Similarly, 
whether we want a synchronic or diachronic corpus depends on whether we want to 
examine changes in language used within a certain time span or not. In historical 
linguistics, corpora are generally diachronic. 

The papyrological corpus in PN is a growing and a changing one. It includes all 
published documentary papyri, and the Greek material ranges approximately from 
the IV century BC to the IX century AD. Newly published texts are added into the da- 
tabase by the academic community of papyrologists via the online Papyrological Edi- 
tor (PE), where a board checks and votes on the submissions. Also, mistakes (typos 
or wrong readings etc.) in the texts that already exist in the corpus, can be corrected 
via the same Editor. This is one reason for the idea that Sematia should also be kept 
open-ended, so that ideally it could include the whole corpus, which represents the 
Greek used in documentary papyri for a period of about a thousand years. Thus, at 
the moment, the corpus design is a loose one, but users (both the annotators and the 
researchers who only wish to perform queries) can decide on a case-by-case basis 
what they want to annotate or include in their searches. Once a version of a text has 
been annotated, that annotation is stable, but if the system alarms us that there has 
been a change introduced into the base text in the PN, the annotator (or someone else, 


4 CELANO 2017. 

5 https://sematia.hum.helsinki.fi. I warmly thank the developer, Erik Henriksson, for all his ideas and 
efforts. 

6 SOSIN 2010, cf. also REGGIANI 2017, 232-40. 
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for that matter) can renew the annotation on that text, if it seems warranted.’ The 
process of getting texts annotated is slow at the moment, since it is performed semi- 
automatically (more on this aspect below). The choice of texts to be annotated is not 
authoritatively dictated by us; the choice is made by the users, so anyone wanting to 
have a specifically chosen set of material, can proceed in annotating the papyri. This 
way s/he also makes a contribution towards the annotation of the whole corpus. And 
when there are more texts already annotated, each researcher mays select his/her own 
subcorpus and perform queries only on them (either in the Sematia platform or after 
downloadins all the selected annotations for external use). The latter option makes 
the research more easily replicable (a basic requirement in corpus linguistic re- 
search). 

Corpus design also includes deciding over the level of annotation and what fea- 
tures are annotated and how. At the moment, our basic approach is to include the 
morphological and syntactic annotation in the form of dependency treebanks. We fol- 
low the Ancient Greek and Latin Dependency Treebank system.? Sematia is designed 
to provide a ‘basic’ level of annotation, because we have this holistic idea ofthe whole 
corpus eventually being annotated; the research questions must not in this case be 
strictly decided beforehand. However, since the automatic morphological parsing has 
been performed on literary texts as mentioned above, this is a logical next step for the 
whole papyrological text corpus as well. This, in turn, would make the manual syn- 
tactic treebanking somewhat quicker, as the morphological forms would be more ac- 
curate than they are now to begin with (on the process of annotation, more detailed 
description below). 


3 Howto annotate papyri? 


Why should we devote a section on how to linguistically annotate papyrus texts? Be- 
cause the papyri represent ancient textual material often preserved in a fragmentary 
condition. The organic writing material has suffered damage of many kinds. But, due 
to the importance of papyri as a source, papyrologists work very hard on reading, 
transcribing, and reconstructing them, i.e. editing the text, so that other researchers 
can also use that source. Still, many gaps and question marks can remain in the edi- 
tions. All this is encoded within the text in the digital edition, in TEI EpiDoc XML,’ 
and for this reason we do not have simple access to the raw text that could simply be 
uploaded for some linguistic annotation tool. In fact, the editorial work gives us 


7 This type of alarm system has not yet been established, but it is on our agenda. 
8 https://perseusdl.github.io/treebank data; BAMMAN — CRANE 2011. 
9 https://sourceforge.net/p/epidoc/wiki/Home. 
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plenty of material that we can and should also use in the linguistically annotated cor- 
pus. Therefore, we need to preprocess the texts available in the PN in a certain way. 


3.1 Preprocessing 


The Sematia tool was first developed mainly for the above-mentioned preprocessing 
need. It creates two parallel layers of the same text; one being a sort of diplomatic 
edition (called “original”), and the other including the editorial suggestions (called 
*standard"). The tool has already been described in another article,” thus I will not 
present the details here. What makes the Sematia corpus special, is that both of these 
parallel layers are linguistically annotated. This way it is possible to study only the 
version that has truly been preserved for us (the original layer), or to compare the 
actual preserved text with its standardized version. The differences in this compari- 
son can be turned into a third layer (called “variation”), which I will briefly discuss 
later. 


3.2 Annotation 


In order for this corpus to be beneficial for all Greek linguists, I decided that we should 
follow the same scheme and standard used in the corpora of ancient Greek. This 
means the Ancient Greek and Latin Dependency Treebank that includes Greek litera- 
ture. In addition, the PROIEL treebank (New Testament and some Greek prose) fol- 
lows the Dependency Grammar." In the annotation of papyri, we follow the Guide- 
lines of AGDT.? At the moment, we use the external annotation environment, 
Arethusa, provided by the Perseids Platform,” with which we have an API integration 
in Sematia. This means that a text can be exported directly from Sematia to Perseids 
and Arethusa, and after it has been annotated, a member of the Sematia board (at the 
moment the project director) goes through the annotations in Perseids and either ac- 
cepts or returns them to the annotator to be corrected. After the approval, the tree- 
banks are committed back to Sematia (both the GitHub repository and the online site). 

The process of annotation in Arethusa includes the tokenization; i.e. tokenization 
is done when the plain text is imported into Arethusa, not into Sematia. The text re- 
ceives an automatic lemmatization and morphological tagging (by Morpheus). But all 


10 VIERROS — HENRIKSSON 2017. 

11 Both treebanks have also been modified for the Universal Dependencies site (http://universalde- 
pendencies.org), where they can be accessed together with many other languages. 

12 Version 1.1: BAMMAN — CRANE 2008, version 2.0: CELANO 2014. Version 2.0 is to be followed, but 
version 1.1 has sometimes more useful examples and more detailed explanations. 

13 http://sites.tufts.edu/perseids. 
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the lemmas and morphological tags need to be checked and corrected by the human 
annotator; there are several forms and lemmas in the papyri, which Morpheus, being 
designed for classical Greek, does not recognize, for example the Egyptian names. 
Moreover, Morpheus does not do wellin selecting the correct form from several hom- 
onyms. The syntactic annotation and dependencies have to be performed manually 
by the annotator. In other words, using Arethusa is convenient up to a point; but it is 
also quite laborious and thus expensive as it needs human resources: skilled annota- 
tors and their time. Nevertheless, in the end, we do get accurate annotations that can 
most likely be used in training automatic syntactic and morphological parsers in the 
future. 

The process can be presented by an example with images. Our sample sentence 
is thesecond sentence ofa letter from Petenephotes to Valerius, written on a potsherd 
in the garrison of Mons Claudianus in the Eastern Desert (O.Claud. II 245,2-7; mid II 
century AD): 


[1] [xaA@g] P rurjoic, &edge_, &à[v ZA0n] |* ñ nope TA vukri rait \néppag por / |° tpia Ceoyn 
dptwv éri ook &|5yo Gptous kai tav ZAGN ñ nof pha néppw ov abtà. 


3.1. tomoer 4.1. mopeia 5-6. élyw — 6-7.l.mo|peta 7.1. oot 


Please, brother, if the caravan arrives tonight, send me three pairs of bread as I do not have any 
bread and when the caravan arrives I shall send them to you. 


Note that the apparatus has several corrections (standardizations), but not for the t/et 
confusion in the conjunction éri (l. émei), 1. 5. This is the standard practice in this 
edition. Other so-called orthographic mistakes are usually standardized in the appa- 
ratus, but not the most common one between 1 / £t, because the editors apparently 
consider this such a common, parallel variant that it can no longer be considered as 
a ‘mistake’ (see also the chapter by J. Stolk in this volume for problems that this type 
of fluidity between editorial corrections can cause). 

The standard and the original layers of this sentence in the Arethusa treebank 
tool are presented in Figg. 1 and 2. Only the syntactic trees can be seen in the screen- 
shots, and only one lemma/morphological analysis (that of the highlighted word), in 
this case the conjunction mentioned above. This is emphasized here, because an au- 
tomatic parser would automatically take this word as the preposition éri, but when 
the human annotator checks the sentence, s/he notices that the preposition is not the 
correct interpretation, and can make the necessary correction, even though the word 
is not editorially corrected in the original electronic source of ours, in the PN. 

The differences between the layers are apparent in the images; the supplied text, 
for example, is not annotated in the original layer, it is represented with a dummy 
marker SU so that the annotator notices that something is missing there and the sup- 
plemented word does not end up in the corpus of original layers. This also leaves 
some of the branches of the sentence tree hanging in the air, as some words that 
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would be the heads on which other words depend on, are not preserved in the papy- 
rus. The non-standard orthography in the original will not prevent the annotator from 
recognizing and marking correct lemmata for the forms, thus lemma searches will 
find all variant spellings of the words from the original layers. 


AretHusa EX tenga narrata um retrai Sn S3 


SU nua , boc. t6SU SU ` oox: ` vari tap nàyeoc por rio roy otu (Bf c< tro &prouc nai 
row Un nopho tu ov abitò 


*^oo0.0-— 


Fig. 1: Original layer of the sentence [1] in Arethusa. 


ARETHUSA EENEEEBEBN S e 


xnMac norione , bapa , dev Ein ^ nopeia `“ voxci raim någor por tpia Gedy ápnov Bl ode Ep 
prouc «oi Grav EM | mopeia nézajas oo OÜTÓ 


“°°. 
tni? 


Fig. 2: Standard layer of the sentence [1] in Arethusa. 
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The underlying XML forms show us what the whole annotation entails. Fig. 3 presents 
the XML of the original layer’s annotation of the same sentence [1]. 


«sentence document_id= H .hum. . 
«word form="SU" head="" id="1" lemmo="" postag="" relation="" /> 
«word form="nurjoic” head="0" id="2" lemma="nowéw" postag="v2sfia---" relation="PRED" /» 
«word form="," head="4" id="3" lemma="puncl" postag="u " relatione"AuxX" /> 
«word forme"Gbekpe” heade"2" ide"4" lemma="&8£X@6c" postag="n-s---mv-" relotione"ExD" /> 
«word forms"," head="4" ide"S" lemmos"puncl" postag="u ” relatione"AuxX" /> 
«word forme"tàSU” heode"" ide"6" lemma="" postag="" relation="" /> 
«word form="SU" head="" id="7" lemma="" postag="" relation="" /> 
«word form="i" head="9" id="8" lemma="ò" postag="l-s---fn-" relation="ATR” /> 
«word forme"mopft" heade"" id="9" Lemmoe"nopeia” postag="n-s---fn-" relatione"" /> 
«word forme"Ti" head="11" id="10" lemma="ò" postag="1-s---fd-* relation="ATR” /> 
«word form="vuxtì” head="" id="11" lemma="véE" postage"n-s---fd-" relation="" /> 
«word forme"raüT" head="11" id="12" lemma»"olroc" postag="p-s---fd-" relatione"ATR" /> 
«word forme "néupac” head="2" ide"13" lemmo="néynw" postag="v-sapamn-" relation="ADV" /» 
«word form="por” head="13" ide"14" lemmo="iyWw" postag="pls---md-" relation="0BJ" /> 
«word form="tpia” head="16" id="15" lemmo="tpia" postag="n-p---na-" relation="ATR" /> 
«word form="CeUyn" heade"13" id="16" Lemmo="Cebyoc” postag="n-p---na-" relation="0BJ" /> 
«word form="Giptwv” head="16" id="17" lemma="èprtoc" postog="n-p---mg-" relation="ATR” /> 
«word form="éni" head="13" id="18" lemma-"imti" postag="c " relation="AuxC” /> 
«word form="oùx" head="20" id-"19" lemma="oò” postag="d * relation="AuxZ” /> 
«word form="éxo” head="22" id="20" lemma="Èxw" postag="vispia---" relation="ADV_CO” /> 
«word form="Gptouc” heod="20" id-"21" lemma-"üproc" postage"n-p---ma-" relation="0BJ" /> 
«word form="xai" heod="18" id="22" Lemmo="xai" postag="c " relation="COORD” /> 
«word form-"Orav" head="27" id="23" lenma="Gtav" postag="c * relation="AuxC” /> 
«word form="#X0n" head-"23" ide"24" lemmo-"Épxoyuou" postog-"v3sasa---" relation="ADV" /> 
«word form="f" head-"26" id-"25" lemma="ò" postag="1-s---fn-" relation-"ATR" /> 
«word form="nopria” head="24" id="26" lemma="mopgia" postag="n-s---fn-" relation-"SBJ" /> 
«word form="néupw" head="22" id-"27" lemmo="néunw”" postage"visfia---" relation-"ADV CO" /> 
«word form="ou" head="27" id="28" lemma-"gó" postag="p-s---mn-" relation="0B)" /> 
«word form="aùtà” head="27" id="29" lemmo="aùtéc" postag="p-p---na-" relation="0B)" /> 
«word forms"." head="@" id«"30" lemmo="punc1" postogs"u *" relations"AuxK" /» 
</sentence> 


Fig. 3: XML of the treebanked original layer of the sentence [1]. 


We can see that the annotation includes the existing form of the word in the sentence, 
its head, its lemma, the postag and the syntactic relation of the word in the sentence. 
The postag includes the whole morphological analysis: part-of-speech, person, num- 
ber, tense, mood, voice, gender, case and degree. For example, the form nepupag 
(word id 13) is a verb, singular, aorist participle active in the masculine nominative. 
The postag gives the very basic morphological analysis, and we could occasionally 
hope for something more specific, such as distinguishing proper nouns from common 
nouns or possessive pronouns from other pronouns, but as of this moment, Morpheus 
gives us these. In the future, other automatic parsers might take these distinctions 
into account more easily. However, even this morphological analysis enables us to 
search complex linguistic structures, especially when combined with the lemma and 
syntactic annotations. This, I think, is sufficient to fulfill the need of basic linguistic 
annotation for the Sematia corpus. Other levels of annotation, e.g. semantic or infor- 
mation structure annotations, would take considerably more time and effort. 
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4 Metadata and its purposes 


The date and place of origin of each text are vital when we wish to see in which time 
periods and in which areas certain linguistic features appear. They are generally pro- 
vided in the papyrus editions and presented also in the PN metadata field, from 
whence they are automatically drawn into Sematia. 

As mentioned already in VIERROS — HENRIKSSON 2017, we add some metadata, 
which is not available in PN, namely aspects relating to the handwriting and the writ- 
ers vs. authors. Some changes have been planned for these metadata fields and they 
will be implemented in the near future. The purpose is to identify text parts written in 
the same hand. When imported to Sematia, each document is divided into ‘acts of 
writing’ by the element <handShi ft>, i.e. each section written by a different writer 
receives its own layers and treebanks. Since there are often papyrus archives in which 
the same hand can have written several documents, it is important to link these acts 
of writing together, so that we can also try to study idiolects and compare certain 
writers to others. At the moment of writing, we can add metadata concerning the 
handwriting! and concerning the writer, the author and the addressee." See Fig. 4 
for an example on the metadata in O.Claud. II 245 (which only has one hand). In many 
cases, however, the name of the actual writer is not known, e.g. in private letters the 
sender of the letter is taken as the author, but the actual writer is not necessarily the 
same person as the author, nor is he named. In contracts, the names of the contract- 
ing parties are mentioned, but the scribe who draws up the text or who pens down 
the letters onto the papyrus often remains unnamed. Therefore, the Trismegistos Peo- 
ple ID cannot be used in identifying the hands, since we have so many hands without 
names to connect them with. Our intention is to give each hand an ID of its own. The 
hands that have been identified to come from one writer (sometimes a very difficult 
task), can be connected to the same ID. The hand-ID will make the current metadata 
field “Same hand” obsolete. 

For the purposes of studying linguistic register and features typical of certain text 
types, we have also included the fields in which we can insert metadata on the text 
type and the addressee. 


14 There are fields for the description of the handwriting in the edition or some other scholarly 
source, the description of the handwriting by the annotator, and the “Same hand” field, i.e. list of 
other documents, where the same hand is said to appear. These fields are text-based, and thus they 
do not provide good searchable data. Every papyrologist is also well aware of the lack of precision of 
these descriptions in different editions. 

15 For each person the annotator can add the name, title and the Trismegistos People ID (http:// 
www.trismegistos.org/ref/index.php). 

16 In its current state, the field is not very usable, user-friendly or accurate; the list of other docu- 
ments where the same hand appears is done in stable URLs of the documents in PN, but one document 
can contain several hands. 
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*etə papysunfo/acbcp/o. daud 2:243, retp./papyr to 
Ascbop/o ctaəq.2 245. rep /papyr 'o/adodo 
Jo.caud. 2:248, wap-/fpapys “vo/Gdbdo/o cuaq2 247, 
"eto /pagyn *Ayocuco/o Baus 2 248. "ttp //papyr. to 
!ddbop/o 30.2249, rep //papyr OIDA 

fo Bead 2250, wap //pspyn rvorododo/o CBAL2 251 
"eto pagya “vo/ocbco/o Saud 2252. rep /rpagyr fo 
PSO. ces 2 233. rep u'/pepyr- wo/ododo 

o. daqa 2254 


Fig. 4: Screenshot of the main view from Sematia, when the document O.Claud. Il 245 is expanded 
(but O.Claud. Il 243 and 246 are not). On the right, the metadata inserted in Sematia by the annota- 
tor is visible. The field “Same hand” is extensive with many documents also written in Petenepho- 
tes’s hand. The editor mentions that the writer is Petenephotes himself,” thus he is both the author 
and the writer. Clicking from the blue “original” or green “standard” buttons would take you to the 
text, and clicking the paper icon next to those buttons, you could view the treebank XML. 


5 Sample results, i.e. what queries can find 


The treebank XML files (including the metadata) in Sematia can be exported for que- 
rying in external treebank query tools.! It is possible to export the treebanks of all 
layers together, or choose the original or standard layers separately. I will not go 
through all the possibilities the external search engines can give for linguists;? I will 
describe some sample searches that can be performed on the Sematia site itself.” 
There, too, it is possible to search only from the treebanks of the original layers or 
only from the standard layers, but one of the essential features is the possibility to 
find instances where the original and standard layers differ. This is where we can get 


17 BÜLOW-JACOBSEN 1997, 69. 

18 E.g. SETS Treebank Search, PML Tree Query Engine or XQuery/BaseX, cf. VIERROS — HENRIKSSON 
2017, 13. 

19 Onethorough treebank-based study on ancient languages is KORKIAKANGAS 2016, in which the au- 
thor has been able to study under which conditions the Latin accusative began to be used as the sub- 
ject case in VIII and IX centuries. 

20 https://sematia.hum.helsinki.fi/tools. 
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more deeply into linguistic variation. For example, it is very simple to search for in- 
stances where one grammatical case is used when editors have thought that a differ- 
ent case would have been more understandable, or more standard (what the editorial 
standardizations might have meant in different times when papyri have been edited, 
see the chapter by J. Stolk in this volume). The search fields in Sematia employ Regu- 
lar Expressions (regex). The searches can naturally be limited in multiple ways, either 
by metadata fields or by the other field related to linguistic annotation, e.g. searching 
only objects or subjects, or only verbs or pronouns. More complex searches combining 
several words or forms would need to be made externally. 

An example search concerning the grammatical case, the dative instead of the gen- 
itive, is presented in Fig. 5. Since the postag holds the case in the 8" place of the string, 
we can use the values for dative (d) in the original layer’s postag field and genitive (g) 
in the 8" place in the standard layer’s field, and let other places of the string be whatever 
else by using the wildcard (.); the beginning of the string is marked by (^). The values 
(d) and (g) can have different meanings in other positions in the postag, thus it is good 
to define the exact location. In other words, when using the search, it is vital to know 
how the annotations have been made, i.e. what each symbol means e.g. in the postag 
field. The guidelines of annotation need to be known and understood. 


Da 


02212248 
0.cinud 2 248 
009212249 
0222 249 
octet 2208 


pad. Gt5 


pact. G15 


Showing 1 to 8 of 8 entries 


Fig. 5: A screenshot of the search and results in Sematia for the dative in the original layer vs. the 
genitive in the standard layer. 
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The search gives eight results with the limited data we have in Sematia at the moment 
(2017, ca. 100 papyri). The result list can be ordered according to different fields, in 
Fig. 5 itis ordered by the document name. We can see that some of the instances may, 
in fact, signal orthographic confusion based on phonological variation rather than 
case confusion (e.g. Nexoutwt / Nexoutov),” but some of the instances more clearly 
tell that the writer has, for some reason, really chosen the dative rather than the ex- 
pected genitive (e.g. Mapovati / Mapovatos). Similarly, we could bring up e.g. all 
prepositions in the texts by simple postag query (Ar), or see where singular verb forms 
appear instead of plural verb form (^v.s vs. ^v.p). In the latter search, the results 
again point to the interplay of phonological factors confusing the morphological in- 
terpretations. See Fig. 6, where two out of three of the singular vs. plural verb form 
are forms consisting of graphemes at / £, both marking the phoneme /e/ at this time, 
and the third one has a / € confusion, which was also perhaps due to weak pronunci- 
ation of the unstressed vowel. These results give us material for further research on 
phonology playing a part in the morphological mergers in Greek, and the impact of ed- 
ucation in writers' ability or inability to use standard orthography in such occasions, 
but they also provide us with material for enhancing our tools in the future. 


Showing 1 to 3 of 3 entries 


Fig. 6: A screenshot of the search in Sematia for a singular verb in the original layer vs. a plural verb 
in the standard layer. 


21 See, however, DAHLGREN 2017, 90 ff. on phonological variation of /o, u/ possibly playing a role in 
case variation. 
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6 Future plans 


A variation layer has been on our agenda since the beginning and it was discussed 
already in the previous article to some extent.2 With the above described method of 
comparison between the original and standard layers, we can only find variation 
(instances where there really are differences between the layers), when there is a 
regularization in the PN, or when the annotator has marked these differences in the 
treebank XML after seeing a difference not available in the PN version. These 
comparisons and differences are planned to be automatically retrieved into Sematia 
to form the basis for the variation layer. In addition, we do need a way to manually 
encode other types of linguistic variation in this layer for several reasons. For 
example, there is a need to further specify certain differences as more phonological 
or more morphological in nature. Secondly, some variation is impossible to detect 
from the annotations when the postag does not really describe what we have in the 
text. I will give an example of this type of case with one sentence from a letter written 
by Ammonius to Apollonius (O.Claud. I 155,3-5; II century AD): 


[2] Aprarjotoc ó kiBaptatns et|^pnké por STL EMLoTOAT ëa Ba amò Tfj; YUVALKOG pov. 
Harpaesius, the cibariator, has told me that I have got a letter from my wife. 


The form éAaBa, “I got", has not been corrected in the apparatus, even though it rep- 
resents mixed morphology; the aorist of the verb AapBavw would be £Aafov accord- 
ing to the classical standard (the second i.e. “strong”? aorist), but in the Koine the ath- 
ematic endings of the first i.e. *weak' aorist (-a for the first person) were occasionally 
used (and they are the ones used in modern Greek).? In the Mons Claudianus ostraka 
so far annotated in Sematia, there are nine attestations of the form £Aofia (plus three 
times written as aïñaßa),” but the editor has fluctuated in correcting it in the appa- 
ratus (see Fig. 7). We can find this word by using the word search, but as can be seen 
from the postag, it is not possible to indicate this type of variation there; the postag is 
the same in both šÀoBo and šÀoBov: first person singular aorist form. It would be very 
convenient to mark this up in the separate variation layer as mixed morphological 
endings in the aorist. 


22 VIERROS - HENRIKSSON 2017, 13. 
23 Cf. HORROCKS 2010, 109-10 and 143-4 on the developments of past-tense morphology. 
24 Allthree in O.Claud. II 236. 
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PRED_CO PRED_CO 


PRED PRED 


O8J CO O8J CO 


Fig. 7: A screenshot of the search results in Sematia for the word form ‘eapo’. In the “word” col- 
umn, the words in green come from the "original" layers and the words in black come from the 
“standard” layers. In O.Claud. volume I, the form was not standardised according to the classical 
norm, whereas in volume ll it was (with one exception). 


We will be developing Sematia and similar tools further.^ One idea is to have the 
whole papyrological corpus already present in Sematia, and updated in set intervals, 
ie. there would no longer be the need to import texts individually. Phonological 
searches will be enabled on the whole corpus. We also aim at developing an auto- 
matic morphological parser for Greek found in papyri, with more accurate analysis 
than what Morpheus currently has. 


25 The project “Digital Grammar of Greek Documentary Papyri" (ERC Starting Grant 2017 no. 758481) 
will use and develop these tools. 
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Joanne Vera Stolk 
Encoding Linguistic Variation in Greek 
Documentary Papyri 


The Past, Present and Future of Editorial Regularization 


1 Introduction 


Linguistic variation in documentary papyrihas been noticed by editors since the early 
days of Papyrology. Some editors make occasional comments about variant spell- 
ings!, others decide not to mention them at all. Kenyon explains his reasons for re- 
fraining from marking variation in the introduction to P.Lond. I: 


It is not to be supposed that any human transcript can be entirely free from errors; but the pal- 
pable blunders in spelling and grammar with which the papyri abound may be credited in the 
first instance to the original scribes. It has not been thought worth while to disfigure the pages 
by appending the warning sic to each such violation of conventional rules”. 


In BGU I (1892-1895), the first “truly papyrological edition" according to Van Minnen,’ 
the editors added to some transcribed words, such as BiBAEidtov, a note in the critical 
apparatus saying “l. BiBAiStov” (BGU I 2, n. to 1. 17).^ The method of the ‘Berlin editors’ 
is followed by Grenfell and Hunt in their editions published in P.Grenf. II (see p. xii) 
and P.Oxy. I. They also briefly explain where they consider such a note to be required: 


Faults of orthography are corrected in the critical notes wherever they seemed likely to cause 
any difficulty. 


My research was funded by The Research Council of Norway (NFR) and the Research Foundation — 
Flanders (FWO). 


1 E.g. Mahaffy in P.Petr. I 12 (1891), n. tol. 15 

2 P.Lond. I (1893), p. vi. 

3 VAN MINNEN 1993, 5-7. 

4 The addition of sic to unconventional language, as referred to in P.Lond. I (see quote above), is also 
found in the early BGU editions, next to the regularizations in the apparatus. For example, in BGU II 
451 we find tdxetov, l. Taxtov (1. 11), àortáceo001 with sic above e (1. 9) and åona|gópeðá og with sic 
above oe (ll. 11-12). This is a good example of the challenges faced during digitization of these older 
editions. All three were initially entered into the DDbDP as regularizations in the apparatus (l. taytov, 
l. àonácac8o and l. oot, respectively). The accusative case og, however, is normal for the addressee 
of the verb àorátopot and does not require regularization to a dative case, even though that seems to 
have been suggested by the sic in the ed.pr. 

5 P.Oxy. I (1898), p. xvi. 


8 Open Access. © 2018 Joanne Vera Stolk, published by De Gruyter. [CERANO] This work is licensed under the 
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110547450-007 
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In 1931, this by then customary practice of regularization was included in the ‘Leiden 
conventions’ during the International Congress of Orientalists in Leiden (7-12 Sep- 
tember 1931). One would expect that the decision about a unified system of critical 
signs would be followed by a discussion on how to use them. Whereas several schol- 
ars have indeed commented upon the precise meaning and use of some of the signs, 
such as the underdot, little explanation has been provided about the practice to reg- 
ularize the Greek language in papyrus documents.° Herbert Youtie describes the pro- 
cess as follows: 


Immediately after the text the papyrologist puts a critical apparatus in which he gives conven- 
tional equivalents for vulgar or mistaken spellings.” 


This leaves the most important questions unaddressed, such as ‘to which forms 
should one apply this procedure?’ and ‘what is a conventional equivalent?’ 

Regularization implies a norm from which the attested variant deviates. This 
norm is generally not explicitly formulated in editions and rarely discussed in sec- 
ondary literature. This makes one wonder whether editors always use the same 
norms. Whereas the early papyrus editions had to cope with readers that were unfa- 
miliar with the Koine Greek language, advances in Greek linguistics and the large cor- 
pus of papyrus editions published to date have made most modern readers more ac- 
customed to the features of Koine Greek. May this have changed editorial practices? 
The digitization of papyrus editions in the Duke Databank of Documentary Papyri 
(DDbDP) required a level of standardization across all editions. How did the digitiza- 
tion process influence the consistency of traditional methods? These editorial prac- 
tices have not been studied before, while they form the basis for our modern tools and 
digital editions. In order to develop new tools and new methods for digital editing, I 
consider it important to examine how the current ones are functioning and how we 
can use existing methods to improve digital technology. 

In this paper I analyse the results of a system of editorial regularization which 
has been in practice for 125 years. The study of editorial practices in the past and pre- 
sent is executed by means of the new Trismegistos Text Irregularities tool. This tool 
collects all editorial interventions that are annotated in the Papyrological Navigator 
(http://www.papyri.info) and allows for detailed searches and analyses of the attes- 
tations.? I will first give a short overview of the parts of the Leiden conventions that 
are relevant for the regularization of language and their current application in the 
digital editions in the Papyrological Navigator (section 2). Then, I will discuss the past 


6 See some notes on the use of critical signs in HUNT 1932 and YOUTIE 1966. Usually, nothing more is 
said about the practice of regularization than "give the standard spelling in the apparatus", cf. SCHU- 
BERT 2009, 202. 

7 YOUTIE 1963, 22. 

8 For more information about this tool see DEPAUW — STOLK 2015. 
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and current use of critical signs and regularizations in the critical apparatus in the 
original and digital editions (section 3). The possibilities for categorization of varia- 
tion and different standards are examined in section 4, followed by a concluding sec- 
tion on how we may be able to combine the traditional and modern aims in the devel- 
opment of new digital tools (section 5). 


2 The ‘Leiden system’ 


At the 18" International Congress of Orientalists in Leiden (7-12 September 1931), the 
participants of the Papyrology section discussed the usage of critical signs in editions 
of inscriptions, papyri and literary authors. They decided on a unified set of conven- 
tions, later referred to as the ‘Leiden system’.’ As this was designed to be a universal 
system for editions of documentary and literary texts, it contained several elements 
which might seem redundant for editing documentary papyri. Two sets of brackets 
were chosen to represent scribal omissions and additions to the text, namely the an- 
gular brackets (...) for “lacunes” and “additions (lacunes comblées)” and the braces 
{...} for “interpolations”. Of course, interpolations that found their way into the origi- 
nal text through copied manuscripts are not commonly encountered in documentary 
material. Consequently, these two sets of brackets are in papyrological practice rein- 
terpreted to represent straightforward editorial ‘additions’ and ‘deletions’ of letters 
and words that were forgotten or added superfluously by the scribe of the document 
for various reasons. The remaining two categories of editorial intervention are “cor- 
ruptions” and “corrections”. Both are indicated in the critical apparatus of documen- 
tary texts and are not distinguished formally in papyrus editions. Van Groningen 
added explicitly that corrections should never replace the text of the papyrus in the 
transcription (as done with literary texts). 

The different types of editorial interventions are all represented in the EpiDoc 
schema used for marking up textual features in digital editions of inscriptions and 
papyri." Accordingly, the papyrological conventions used in the Duke Databank of 
Documentary Papyri include the angular brackets for “Characters erroneously omit- 
ted by the scribe, added by modern editor”, the braces for “Superfluous letters re- 
moved by the editor” as well as the option to put regularizations in the critical appa- 
ratus." The regularizations in the apparatus can be tagged in different ways in 
EpiDoc, namely as “Correction of erroneous characters” with the two alternatives 
marked by <corr> and <sic> and as “Regularization of dialect or late spellings, 


9 See Essai d’unification des méthodes employées dans les éditions de papyrus, CE 7 (1932), 285-7. 
10 VAN GRONINGEN 1932, 268. 

11 EpiDoc is a TEI-based XML encoding standard developed for digital editions, see BODARD 2010. 
12 http://papyri.info/conventions.html, accessed on 22 May 2017. 
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etc." marked by «orig» and «reg».? Both are used in the collaborative online ed- 
iting environment of the Papyrological Navigator, called the Papyrological Editor.” 
This platform uses a non-XML representation of the EpiDoc schema, called ‘Leiden+’, 
in order to facilitate easy entry of new texts by its users.” 

All editorial conventions used in Leiden- are explained to the user in a set of 
online guidelines. The Leiden Documentation tells the digital editor to distinguish 
between a “spelling correction" to be used for “correction of outright scribal error" Y 
and an “orthographic regularization" to be used for a *non-standard orthographic 
form”.! According to the guidelines, critical signs should be used for spelling correc- 
tions as well, which reduces the practical difference between the four categories into 
two basic types. The PN is thus expected to encode 
1. ‘corrections’ by means of critical signs (for additions and omissions) and in the 

apparatus (for substitutions and more complex cases), and 
2. ‘regularizations’ of non-standard forms in the apparatus. 


3 Editorial regularization in practice 


Although papyrologists have agreed on the methods to be used in papyrus editions, 
as described above, the application of these basic principles is not self-evident. Her- 
bert Youtie already stated in his prolegomena to the textual criticism of documentary 


papyri: 


it is a far cry from subjective opinion to objective reality, although no hint of this difficulty is ever 
betrayed in the definition of the signs that we find in papyrological manuals.” 


13 For more information about these two and other possible editorial interventions see http://www. 
stoa.org/epidoc/gl/latest/app-alltrans.html, accessed on 22 May 2017. 

14 http://papyri.info/editor. 

15 BAUMANN 2013, 102-4; SOSIN 2010. 

16 TheLeiden- guidelines (http://papyri.info/docs/leiden plus) have been subject to revision since 
the start of the editorial interface to the Papyrological Navigator. The unfortunate decision to display 
the corrected reading in the text and the original in the apparatus has been changed to the common 
practice in editions to show the original text in the transcription and regularizations in the apparatus. 
However, this technical change still has some consequences for the display of critical signs, line 
breaks and accents of regularized words that were entered before the change. Some attempts have 
been made to clarify the distinction between corrections and regularizations in the guidelines with 
varying results, cf. section 3. 

17 http://papyri.info/docs/leiden_plus#spelling-correction, accessed on 22 May 2017. 

18 http://papyri.info/docs/leiden_plus#orthographic-regularization, accessed on 22 May 2017 

19 YOUTIE 1974, 64. 
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While this may apply to all critical signs, it is especially true for the editorial regular- 
izations of the language found in papyrus documents. I will illustrate this by some 
examples mentioned below.” 

Following the basic distinctions available in EpiDoc (see section 2), I will distin- 
guish between the so-called ‘corrections’ indicated by means of critical signs and in 
the apparatus (section 3.1) and ‘regularizations’ in the apparatus (section 3.2). The 
starting point for this comparison is the database of TM Text Irregularities, which con- 
tains a collection of all editorial regularizations in papyrus editions in the PN.” There 
are two stages to take into account: the regularization indicated in the editio princeps 
and the annotation in the digital edition in the PN. This method will allow me only to 
quantify the outcomes of the second stage of this process. It should be noted that the 
digital edition in the PN is not always a true replica of the original edition, as more 
regularizations have been added in an attempt to level out the differences in conven- 
tions between various (older) editions. Hence, for every example mentioned below, I 
will also compare the digital regularization with the one in the original edition in or- 
der to reflect on possible differences between the two stages of editing. 


3.1 Corrections and critical signs 


The EpiDoc schema offers the possibility to distinguish between corrections of scribal 
errors and orthographic regularizations (see section 2). The application of a special 
‘correction’ tag results in the addition of (corr) after the corrected form in the ap- 
paratus of the digital edition. In practice, it has never been in frequent use and some 
earlier instances have been automatically converted into regularizations. The remain- 
ing 140 corrections might have slipped through the net at an earlier stage or may have 
been added later, as users are still confronted with guidelines mentioning this option.” 

A closer look at the instances that are encoded as correction at the moment re- 
veals that a significant part of them does not seem to fit the definition of “outright 
scribal error”. Regularizations of interchanges resulting from phonological mergers, 
such as ig to eic in O.Claud. IV 723 and IIapaóioov to Mapadeioovu, AgiBa to Mfa and 
[am]o8dow to [an]o5wow in SB XXVI 16796,10-11, 16, are regularly found among these 


20 Alleditorial mistakes and problematic instances marked out in this article can of course be revised 
through the Papyrological Editor, reducing the amount of variation slightly. These examples are, how- 
ever, understood to be representative for some more fundamental problems with the practice of lin- 
guistic regularization. These problems and their possible solutions will be discussed further in section 5. 
21 http://www.trismegistos.org/textirregularities, state of PN January 2014. Part of the search que- 
ries for this paper are made in the offline database, state May 2017. 

22 For some of these texts someone from the editorial board already suggested changing the correc- 
tion tags into regularization tags before finalization of the entry, see for example the editorial history 
of O.Did. 417 and P.Naqlun II 22, but these changes did not find their way into the online edition. 
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corrections (50 times).” Morphological regularizations are also common (41 times). 
For some of those, it is possible to see why the (digital) editor regarded them as scribal 
errors. For example, in BGU XVII 2682,6, 9-10, the scribe mechanically added the 
standard accusative object ywpiov dpmteAtkov, whereas in this particular construction 
([O]poAoy@ ... pepidav | piav ywpiov àpmeAikcóv) the noun phrase should have been a 
genitive partitive to the object pepidav | piav. In P.Gen. IV 192,10-11, the pronoun 
oot was inserted too early and ended up with the wrong verb: óuoÀoy@ éxetv oot xai | 
xpeworteiv instead of ópoAoyà £yev xai | ypewoteîv oot. Printed editions do not make 
a distinction between mechanical scribal errors and other regularizations, although 
they sometimes provide an explanation for the variation in the commentary (as was 
done for BGU XVII 2682, n. to 1. 10). Apart from those occasional comments, the inter- 
pretation of the distinction between regularization and scribal error depends largely 
on the person digitizing the edition. The phrase ov vavdarg xè é£kataotÃç was regu- 
larized as “l. vavAotc Kai Ekatootaic” in the apparatus of P.Jena II 8,7, but vavàdorg 
was entered into the PN as a correction, kai as regularization and éxatootijs as regu- 
larization (probably mistakenly for &katootaîg). Obviously, the distinction between 
the two types of regularizations creates a great challenge for the digital editor, espe- 
cially without a clear definition of 'scribal error' at hand. 

Besides the special correction tag, simple scribal errors can also be indicated with 
critical signs according to the guidelines (see section 2).” The angular brackets (for 
editorial additions) and braces (for editorial deletions) are in common use in both 
printed and digital editions. In TM Text Irregularities, we collected a total of 6,920 
instances of the use of angular brackets and 3,063 attestations of braces in the digital 
editions in the PN. Both ofthem are primarily used for scribal omissions and additions 
of whole words, amounting to 6696 and 80% of the instances of the angular brackets 
and braces respectively. This also forms the main distinction between the use of crit- 
ical signs in the text and regularizations in the apparatus: the critical signs mark ad- 
ditions and deletions of whole words, while regularizations are almost exclusively 
limited to parts of words.” However, the critical signs are also used for single letters 


23 Based on the collection in TM Text Irregularities I made a list of the ‘corrections’ that are more 
likely to be the result of phonological changes in the language (cf. 4.1), so that these could be con- 
verted into regularizations in PN. Josh Sosin replied to me that these corrections will be converted, 
but the option to distinguish between different types of errors is going to be maintained in the PE in 
the future (personal communication, 7 June 2017). 

24 See also VIERROS 2012; STOLK 2015, 268-71. 

25 http://papyri.info/docs/leiden plus£leiden-angle-brackets; 
http://papyri.info/docs/leiden_plus#leiden-braces; 
http://papyri.info/docs/leiden_plus#spelling-correction, accessed on 23 May 2017. 

26 Ifregularizations in the apparatus are used for the addition of several words, the angular brackets 
are sometimes added to the apparatus entry as well, e.g. xeipoypo«peica, l. xeipoypapiia &nAf| 
ypap)etoa in P.Oxy. XXXIV 2724,20. 
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and patts of words and in this usage they often overlap with the regularizations. 
Around 82% of the angular brackets and braces put around part of a word are in fact 
used to indicate interchanges at a phonological and/or morphological level, such as 
{e)t or {e} (160 times) and the addition and omission of final -ç (237 times) and -v (198 
times). If one would want to achieve a meaningful difference between the use of crit- 
ical signs and regularizations in the apparatus, critical signs in the text should not be 
used for orthographic and morphological interchanges affecting only a single letter 
or part of a word.” 


3.2 Regularizations in the apparatus 


Regularizations are traditionally indicated with ‘l.’ for lege “read” in the apparatus of 

an edition. They make up the majority of all instances of editorial linguistic interven- 

tion in papyrus documents (92 %), amounting to more than 120,000 instances in all 
digitized papyri. Most of the editorial regularizations concern orthographic varia- 
tion caused by changes in the pronunciation of Koine Greek (70%). Another signif- 

icant part of the regularizations affects the spelling and use of morphemes (26%), 

such as case and verb endings. I divide the variation at a morphological level into 

two types: 

1. morphological interchange between different declensions or conjugations, such 
as the variation between an accusative singular in -a and -av for consonants 
stems or between the sigmatic and root aorist inflection of certain verbs,” and 

2. morphosyntactic variation between the use of morphemes in a particular syntac- 
tic context, such as between a genitive or a dative case to express the recipient of 
a verb of giving or between an indicative or subjunctive following the conjunction 
(vo. 


Both types occur among the regularizations in the apparatus. In some cases, morpho- 
logical or morphosyntactic variation may be related to phonological merger as well. 
An example of this is the frequent interchange of o and o, of which one third of the 
instances are found in case endings (e.g. tov / Gv) and two thirds in other positions 
(e.g. wKTW / OKTW). It is, therefore, not always easy to distinguish different types of 
variation based on the level of language organization that they apply to. 

Almost 40% of the regularizations of orthographic variation concern the inter- 
change of t and et. For most of these variant spellings, regularization is not strictly 
necessary in order to understand the meaning of the word. Still, there are many forms 


27 Apart from the large group of common phonological and morphological irregularities, the remain- 
ing 20% may concern a relatively high portion of potential ‘scribal errors’. The problematic identifi- 
cation of these 'scribal errors' will be addressed in section 4.1. 

28 GIGNAC 1981, 45-6 and 290-7. 
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for which one spelling has been regularized consistently in (almost) all instances 
throughout the corpus, such as îoot to £ikoot, £ytc to Exetc, £AOtv to £AO&tv etc. This is 
partly due to the addition of regularizations during the digitization process. For exam- 
ple, in the ed.pr. of P.Oxy. XLIII 3117 interchanges between t and £t are only regularized 
when they could be confusing (e.g. ¿ni to l. émei in Il. 6 and 14), but many others have 
been added in the digital edition, such as to pifAeía in 1. 4, xotvwviv in 1. 5 and 
àroxpeítvaoOat in 1. 6. The few instances where regularization in the PN is lacking may 
be caused by human error, such as the typo ‘nåde for maAetv’ rather than náà in the 
digital edition of in P.Oxy. XLIII 3117,13-14 (http://papyri.info/ddbdp/p.oxy;43;3117), 
or the regularization of nepei to nepi in the ed.pr. of SB XX 14990,15, which seems to 
have been overlooked in the Sammelbuch and the digital edition. 

There are also words for which the standard spelling may be more difficult to es- 
tablish. According to classical rules, the suffix of the derived noun orepqueta “excel- 
lency” is spelled with sı.” This is also the spelling which is found in the majority of 
the VI- and VII-century papyri, such as the attestations in P.Oxy. 1135-138 published 
in 1898. The alternative spelling Urepqovta is not regularized in P.Oxy. I 144,4, nor in 
the five papyri P.Cair.Masp. I 67003, 67005-67008, published in 1911. Regularizations 
of brep~uta do occur in editions that were published later, such as P.Ross.Georg. V 
34,2 (published in 1935), CPR XXIV 27,17 (published in 2002), and P.Oxy. LXX 4790,16, 
19 and 30 (published in 2006). The alternative spelling in P.Oxy. I 144,4 became even- 
tually regularized in the online edition. P.Cair.Masp. I 67003, 67005-67008 remain 
without regularization in their online editions.” Remarkably, a regularization of the 
common form Urtepquto to Urtepqvia was also added to the digital edition of P.Lond. 
III 774-778.” Whereas the earlier editions seem rather modest with regularizations of 
words that can be perfectly understood without, the growing need for consistency 
may have extended regularization to be applied to all ‘non-standard’ forms without 
agreement on the definition of ‘non-standard’. 

The Leiden conventions were designed to do reduce variation in editorial prac- 
tices. The common format of a transcription with a critical apparatus containing regu- 
larizations becomes indeed the standard for all editions, but the variation in regulari- 
zation practices continues in printed editions after 1931. The word pipAuoquAóktov 
“archive” is spelled as such in 22 papyri and as BiBAlopuXdxketov in six papyri between 
the II and IV centuries AD. The spelling BiBAtopvAdketov is regularized to 
BiBALopuAdktov in the edition of P.Diog. 20, 6, and the online editions of SB VI 9625,23, 


29 HERRING 1989, 31-3. 

30 Cf. PALMER 1945, 54. 

31 Perhaps accidentally; or because the alternative spelling seems to have been the norm in the Di- 
oscorus archive. 

32 These documents originate from the Apion archive, just as most of the other documents with the 
word ùrepgveta, and they show the spelling that is normally found in this archive. It is, therefore, 
not clear what the regularization was based on. 
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PSI V 454,19, and P.Tebt. II 318,23; the normalized spelling is also found in the index 
of the last two editions. For the two editions that remain without regularization (P.Gen. 
P 144,23; P.Hamb. I 16,22), the spelling BiBAtomvAaxeiov was used both in the texts and 
indices of the original editions. Strikingly, the more common spelling BiBAtopvAdKtov 
was even regularized to BiBAtopuAdketov in the first editions of P.Oxy. XXXIII 2665,17 
and 19, P.Fam.Tebt. 15,iii,79, and P.Hamb. IV 244,12, and there are also editors that 
supplement the word in this spelling in abbreviations or lacunae (see BGU III, p. 2 to 
BGU I 243,15, taken over in Chr.M. 216; Chr.M. 217,9, and P.Fam.Tebt. 29,44). 

Inconsistent regularizations, such as the ones mentioned above, often require 
careful analysis to determine whether this apparent lack of consistency can be justi- 
fied in any way in each of the given situations and based on the material that the 
editors had at their disposal. Similarly, complicated situations arise when one at- 
tempts to regularize morphosyntactic variation. The phrase dv oov Tñ THX! 60£n “if 
it seems right to your fortune” occurs regularly in petitions from the II and III centu- 
ries AD. The second person singular pronoun is usually in the genitive case (oov), but 
it is also attested in the dative case (cot). The dative oot is regularized into a genitive 
gov in SB XXIV 15915,6, while SB XVIII 13732,13, regularizes the common genitive into 
the dative in this phrase.” Confusion about the use of the dative or genitive case in 
these types of constructions is common among both scribes and editors and regular- 
ization is often far from straightforward.” 

Inconsistent regularizations are usually caused by a lack of agreement about the 
method of standardization. Differences between older editions have not always been 
levelled out during the digitization process and they might even have gotten worse in 
some of the more complicated examples mentioned above. Some editions take a more 
extreme approach than others when it comes to choosing a method for regularization. 
Common itacistic spellings, such as &tva and ic, are often regularized in papyrus edi- 
tions, but notin the editions of the Mons Claudianus ostraka. This is probably because 
these particular interchanges are very common in these ostraka and regularization 
seems unnecessary.” This practice is not entirely consistent throughout the volumes 
(e.g. O.Claud. IV 723 and 839 regularize ic, but O.Claud. IV 724 and 840 do not). Regu- 
larizations have been added during the digitization process in accordance with other 
papyrus editions, but the end result is still far from uniform (e.g. iç has been regularized 
in the digital editions of O.Claud. II 248 and 276, but not in O.Claud. II 363 and 383). 

Comparison between texts in the same volume and among other parallel texts is 
a common practice, but it is not the main method of regularization in most papyrus 
editions. The word voookopeiov “hospital” is attested in full in 14 papyri dated to the 
VI and VII centuries. Only one of those attestations is spelled with £t (SB I 4668,4), as 


33 See STOLK 2017, 196-7 with n. 31. 
34 For more examples see STOLK 2015 and 2017. 
35 Compare the comment by Grenfell and Hunt in P.Oxy. I, cited above in section 1. 
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is also common in modern Greek, while all the others write voooxopiov.* Based on 
comparison to contemporary documents, no regularization seems required for the 
other instances. In reality, regularizations to the standard spelling vocogopetov are 
found in original editions (e.g. P.Bodl. I 47,12, 20 and 26; CPR XXII 2,1, 5 and 9) and 
digital editions (e.g. P.Amh. II 154,2 and 8; P.Lond. III 1324,7), while others remain 
without any form of regularization (e.g. P.Oxy. XVI 1898,19 and 38; P.Oxy. XIX 
2238,18)." This combination of different methods will inevitably lead to more incon- 
sistencies within and between printed and digital editions in the future. 


4 Standardization 


Past and current approaches have not resulted in a clear distinction between ‘scribal 
error’ and ‘non-standard variant’ in the PN (see 3.1). The question remains whether it 
is possible to distinguish scribal errors from other types of variation and whether we 
should want to make such a formal distinction in (digital) editions (4.1). 

It has been shown that regularization of variation due to phonological, morpho- 
logical and morphosyntactic changes is not always consistent (see 3.2). Editors may 
use different methods to identify the norm and, consequently, these norms may differ 
from each other. In section 4.2, I will discuss various possibilities for establishing a 
standard for comparison. 


4.1 Scribal errors 


The traditional aim of textual criticism is the “Herstellung eines dem Autograph (Orig- 
inal) móglichst nahekommenden Textes”. Any corruptions to the text are caused by 
“the inability of scribes to make an accurate copy of the text that lay before them”.” 
Hence, any form of scribal intervention can be regarded as a mistake.“ Similar phe- 
nomena, such as misreading of the exemplar, orthographic variations and accidental 
alterations, occur in duplicate papyri, but not all documentary papyri are the result 


36 The spelling vocokopiov is also common in Coptic, cf. FORSTER 2002, 549, and see e.g. CPR IV 
198,16 and 21. 

37 Supplements for abbreviations show the same variation. The spelling with t is supplemented in 
abbreviations in Stud.Pal. III 314,1, Stud.Pal. VIII 791,1, and 875,2, while et has even been supple- 
mented in papyri where the spelling with tis found elsewhere in the same text, see CPR XXII 2,1, 5, 9 
and 11; P.Oxy. LXI 4131,16 and 39. 

38 Maas 1950, 5. 

39 REYNOLDS — WILSON 1991, 222. 

40 Some examples of such (deliberate or accidental) mistakes are given in the list in REYNOLDS — WIL- 
SON 1991, 222-33. 
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of copying.“ Therefore, our definition of scribal error has to be different from the one 
used for copying literary texts. 

Papyrus documents are the product of their own time and not the result of several 
centuries of transmission. Therefore, changes in the language do not need be re- 
garded as scribal or copying errors in documents. Still, a division between scribal er- 
ror and linguistic variation is commonly applied in linguistic approaches. Variation 
in the written language can be used to reconstruct changes in the history of the spo- 
ken language. In order to do that, significant variations, i.e. interchanges reflecting 
the spoken language, have to be separated from “Verschreibungen””, “garbage er- 
rors"? or “manifest blunders”. Gignac identifies this difference between “phoneti- 
cally significant variation” and “sheer mistakes and slips of the pen” by the principles 
of frequency and regularity: 


If certain letters or groups of letters interchange only rarely and irregularly, there might be an- 
other explanation.‘ 


His other explanations include (a) anticipation and repetition, (b) inversion, (c) me- 
chanical reproduction, (d) analogical formation and (e) etymological analysis." 
These examples of variation which occur irregularly and do not seem to reflect the 
spoken language can be described as ‘scribal errors’. Scribal errors of this type can 
usually be explained by common cognitive processes.‘ 

Mechanical and cognitive processes may explain the appearance of scribal errors, 
but they do not constitute a comprehensive categorization or definition of the phe- 
nomenon itself. Haplography and dittography, for instance, are prime examples of 
the cognitive processes of anticipation and repetition (a). However, the simplification 
and gemination of consonants can also be explained by “the identification in speech 
of single and double consonants".*? Hence, the example of “outright scribal error, e.g. 
ot[platteds for otpatnyòg” given in the Leiden documentation” can also be ex- 
plained by hypercorrective gemination of the consonant, the phonetic similarity of £ 
and n and the omission of y in the pronunciation as glide.? Even the loss of a full 


41 Foratypology of scribal errors in duplicate papyri see YUEN-COLLINGRIDGE — CHOAT 2010. 
42 KAPSOMENAKIS 1938, 4. 

43 LASS 1997, 62. 

44 JANNARIS 1907, 68. 

45 GIGNAC 1976, 57 and 59. 

46 GIGNAC 1976, 59. 

47 Cf. KAPSOMENAKIS 1938, 4. 

48 GIGNAC 1976, 154-5. 

49 http://papyri.info/docs/leiden_plus#spelling-correction, accessed 22 May 2017. A better example 
is <:tipnvilcorr|tpynyv:>. 

50 GIGNAC 1976, 242-7 and 71-5. 
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syllable may sometimes have a phonetic explanation.” Inversion (b) is another prob- 
lematic category. Although the transposition of two letters may result “in spellings 
like atmoshpere which do not reflect an actual spoken form”, metathesis of a vowel 
and resonant, especially p, is relatively frequent in the papyri and may have had a 
parallel in speech.? Mechanical reproduction (c) seems to identify a type of variation 
that is indeed limited to the written language, but the two remaining categories on 
Gignac’s list are not scribal errors strictly speaking either. Analogical formation (d) 
may not be caused by phonological changes, but can be indicative of morphological 
change in the spoken language, as is also acknowledged by Gignac. When a form 
can be explained by changes in the spoken language (phonological or morphologi- 
cal), it should not be classified as a scribal error according to the definitions men- 
tioned above. Etymological analysis (e), such as the spelling of ék- in compounds be- 
fore a voiced consonant, may not be relevant for the actual pronunciation of the word 
in later periods, but this change in orthographic conventions is better classified as 
orthographic variation than as a mechanical scribal error. 

Mechanical scribal errors in papyrus documents have received little study in their 
own right. Negative definitions prevail in the secondary literature aiming at the re- 
construction of the original text or the spoken language. Gignac gives an excellent 
introduction to his method, but his overview of orthographic variations that are not 
phonetically significant cannot be used as a typology of scribal errors in documentary 
papyri.” Editors should feel free to discuss causes for variation in their commentaries 
and digital editors might want to continue experimenting with these distinctions, but 
it would be better to treat possible scribal errors in the same way as other types of 
variation in order to secure stable future reference to all variant forms. 


4.2 Different standards 


Regularization implies the use of a standard. Every editor who regularizes the lan- 

guage found on a papyrus compares the attested words and constructions with a cer- 

tain norm. How and why this norm is chosen is usually not stated explicitly, but can 

be inferred to a certain extent from the patterns of regularization observed above (see 

3.2). There seem to be two main sources for comparison: 

1. external sources, such as rules described in dictionaries, grammars and text 
books, and 


51 Cf. GIGNAC 1976, 312-3. 

52 GIGNAC 1976, 59 and cf. pp. 314-5. 
53 GIGNAC 1976, 59. 

54 GIGNAC 1976, 57—60. 
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2. internal sources, such as other instances in the text itself or parallel texts that are 
ideally closely related in contents and context. 


Regularization to vocokopetov, for example, was probably based on external criteria 
in most instances, since this spelling is rarely found in contemporary papyri. The 
spelling of ic, on the other hand, may have been left without regularization in the 
ostraka from Mons Claudianus based on comparison to other ostraka from the same 
area. As long as the attestations found in close parallels corroborate the external 
standards, editorial regularization of variant spellings tends to be consistent. As soon 
as both variants seem to be in regular use in contemporary papyri, such as with 
BiBAioquAdk(e)tov and Umep~pu(e)ta, different editorial principles and methods may 
lead to conflicting results. 

It is not true that classical norms were especially used in the early days of papy- 
rology and comparison with contemporary documents is an entirely new phenome- 
non. Variation in regularization practices is particularly common in early papyrus 
editions and classical norms are not consistently applied at all (cf. 3.2). Recent studies 
of the language of the papyri, often from a variationist perspective, have raised 
awareness of the possibility that scribal variation could be explained by its context.” 
This may have led some editors to consider more context-sensitive methods, but also 
more practical considerations may have prevented editors from regularizing spellings 
that occur very frequently in a specific group of documents. The variationist idea that 
linguistic variation is dependent on its context is not an entirely new concept to pap- 
yrologists. The principle of comparison with parallel texts for understanding and sup- 
plementing another papyrus has been in use for a long time. In order to interpret the 
language used in papyri, Youtie suggests the use of dictionaries, grammars and *an 
unremitting search for parallels”.** He further notes that 


U. Wilcken has somewhere characterized papyrology as a “Parallelenjagd”. No term could be 
more apt. A good share of the papyrologist's working time is devoted to searching for parallels.” 


Parallel examples are essential for a papyrologist to get familiar with the language 
and contents of different types of documents, to date the text and to identify the 
standard clauses used at different times and places.” Even though this method has 
been used for many years to interpret new texts and to supplement words and phrases 


55 See the papers in EVANS — OBBINK 2010; LEIWO — HALLA-AHO — VIERROS 2012; CROMWELL — GROSSMAN 
forthcoming. 

56 YOUTIE 1974, 33-7. 

57 YOUTIE 1974, 42 n. 39. 

58 Cf. TURNER 1980, 59-61. The ‘hunt for parallels’ is one of the main incentives for the digitization of 
papyrus editions, because it makes it easier for papyrologists to search for parallels in a large corpus 
of published papyri. 
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in fragmentarily preserved papyri, it is not always deemed suitable as a standard for 
linguistic comparison. Classical orthography and morphology are often understood 
to be the only proper standard for the language used in regularizations, in supple- 
ments of abbreviations and in lacunae.? Kapsomenakis already voiced his concerns 
about the artificial norms that tended to be applied to the Greek language in papyri: 


Übrigens hat eine volksmafig frei entwickelte Sprache ihre eigenen Gesetze, denen sie folgen 
muß, wenn sie ihre Aufgabe, der praktischen Verständigung zu dienen, erfüllen will. Die Vulga- 
rismen dürfen also diesen Gesetzen nicht widersprechen. Weiter hat die Verkennung der Rechte 
der Volkssprache dazu geführt, daf man viele Schreiberfehler entdeckte, die man nach der Me- 
thode beseitigen zu müssen glaubte.© 


Classical Attic norms continued to be used as the standard for orthography and mor- 
phology in post-classical periods, but it seems difficult to justify applying anachro- 
nistic norms in cases in which a variant form is frequently or even normally used in 
Koine Greek. Lack of the awareness of the norms for the language used in papyri can 
easily lead to misplaced regularizations, reconstructions and even readings.* The dis- 
crepancy between classical Attic and contemporary usage as the norm for editorial 
regularization is probably caused by a general lack of information about contempo- 
rary norms, as has also been pointed out by Youtie: 


But it is perhaps lack of linguistic information which trips us most often. Sometimes this takes 
the form of insufficient regard for the general laws of Hellenistic Greek, sometimes it is simply 
failure to search out the similar passages which are available in other papyrus texts. Whatever 
its cause, it has a crippling action capable of twisting our texts into fantastic shapes.9? 


Knowledge about Koine Greek in general and the linguistic norms applied in papyri 
in particular are essential ingredients for a good papyrus edition and may help to pre- 
vent many reading errors and problematic restorations. On the other hand, the stand- 
ards for orthography, morphology and morphosyntax in Koine Greek have still re- 
ceived little attention in research to date and there is no reference work that editors 
can use to identify a standard for every word or construction. These norms can, there- 
fore, only be identified by manual comparison among a selection of documents. This 
creates the typical gap between the use of external sources based on classical Greek 
and the contemporary internal evidence. 


59 Linguistic inconsistencies in the practices of restoration of the text in lacunae are clearly pointed 
out in EVANS forthcoming. I thank Trevor Evans for kindly sharing this unpublished paper with me 
and for sharing his thoughts about these issues. 

60 KAPSOMENAKIS 1938, 4. 

61 The problematic consequences of the practice to restore (and even read) classical Greek forms 
where they have not been written originally are illustrated in CLARYSSE 2008 and YOUTIE 1974, 8-10 
and 13-16. 

62 YOUTIE 1974, 13. 
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Koine Greek has never become a general standard for editorial regularization, alt- 
hough there are some exceptions. An example of such a well-known orthographic 
norm in Koine Greek is the spelling of the verbs yi(y)vopat ‘to be, to become’ and 
ylly)vwoxw ‘to know’. Mayser and Schmoll state that the spellings yivopai and 
ytvwokw are used without exception in the Ptolemaic papyri and Gignac confirms 
that these are also the normal spellings in the Roman period.# Accordingly, the 
spelling yivopat is usually not regularized and the Koine Greek spelling is used in 
most supplements of abbreviations of the verb.“ Still, regularization to yiyvopat is 
found in about a dozen editions (e.g. P.Bodl. I 17,i,9; P.Haun. II 22,5; P.Oxy. LXIV 
4441,x,27; P.Petra I 4,5) and has occasionally been added to digital editions as well 
(e.g. O.Claud. IV 798,6; P.Stras. VIII 772,6, 9, 15 and 21). In contrast to the relatively 
limited number of regularizations of yivopat, the verb yivwokw has been regularized 
to yiyvwokw in more than a hundred instances. Most of these regularizations, how- 
ever, concern verbs with other spelling irregularities (almost 90%), such as yetvwoxktv 
to ytyvwoxetv (e.g. P.Col. X 278,4; SB XXIV 16290,2 and 16291,4).° When regularizing 
these other aspects, the idea of the classical standard seems to have overruled Koine 
Greek spelling conventions. The verb yivopat is also frequently spelled as yeivopat, 
but this rarely provoked regularization to the classical spelling of the consonants. The 
fact that the spelling of the verb yivopat often serves as the prime example of language 
change in Koine Greek, may have convinced editors to take the Koine Greek spelling 
as the standard for this verb more often.“ The differences in regularization between 
these two comparable verbs clearly illustrate the competing principles of regulariza- 
tion. 


5 Towards a new approach 


In the previous sections, I have illustrated the various practices and principles for 
editorial regularization as they have been used up till today. Editorial regularizations 


63 MAYSER — SCHMOLL 1970, 15 and 156; GIGNAC 1976, 176; see also LSJ s.v. 

64 Supplements of abbreviations and lacunae are other sources for editorial disagreement on lin- 
guistic variation. Different principles, such as regularization to classical orthography and comparison 
within the document or to other contemporary documents, are used by different editors. Since there 
is no current method to search for attestations in the real text only, search results are often biased for 
standard forms found in supplements and the apparatus. 

65 Paul Schubert regularized yetvwoxtv to yivwoxetv in the ed.pr. of SB XXIV 16290 and did not put 
a regularization to yewwoxi[v in the ed.pr. of SB XIV 16291, see SCHUBERT 1997, 193-4. Clearly, the 
need for standardization of regularization practices is not only felt during the digitization process, 
but also in large collections of papyrus editions such as the Sammelbuch. 

66 The sic of the editors behind the unusual spelling and morphology tà ytyvwpevot in O.Edfou II 
318,7, was even regularized to the Koine Greek spelling l. yivòpeva in the digital edition. 
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in the apparatus are used to indicate phonological, morphological and morphosyn- 
tactic variation (3.2), while critical signs, such as angular brackets and braces, are 
mainly used by the editors to mark the addition and omission of one or more words 
(3.1). When brackets and braces are applied to single letters or parts of words, their 
function largely overlaps with the regularizations in the apparatus. More study is 
needed to separate accidental scribal errors from other types of variation in the papyri 
(4.1). The same applies to establishing contemporary standards for Koine Greek (4.2). 
Both goals are worthwhile pursuing in separate studies in order to gain a better un- 
derstanding of the use of language in papyri, but such a distinction between different 
types of variation or different standards might not be essential for establishing a more 
consistent practice of encoding linguistic variation. 

The question comes down to what we would like to achieve with editorial regu- 
larization in papyrus editions. Are we trying to correct accidental scribal mistakes in 
the way the scribe would have wanted to? Are we normalizing the language to con- 
servative or contemporary standards? Or are we just helping the classically schooled 
modern reader to understand a text written in a different variety of Greek? This last 
idea was probably an important reason to start providing standard Attic equivalents 
in the apparatus, as Turner explains: 


The critical apparatus [...] can also usefully show how the editor understands his text. The word 
‘read’ or symbol I. = ‘lege’ need not mean that the Greek is incorrect: it is a sign of how it can be 
interpreted in terms of standard Attic Greek. 


The fact that Turner has to explain what is not meant by this sign immediately points 
out that the use of the word “lege” can be misleading. The command “read” is easily 
interpreted as a correction rather than an equivalent. This inherent ambiguity is 
worth noting here. Other, more appropriate, signs should be considered for future 
printed editions. For digital purposes, however, it would be better to take a different 
approach altogether. As the apparatus shows ‘how the editor understands his text’, 
the ideas about what should be explained in the apparatus and what not can differ 
significantly from one editor to the other. Some editors may follow this practice very 
strictly and always provide standard equivalents, whereas others may think that this 
is only necessary for forms that are less common and more difficult to understand for 
the reader of the edition, such as in the earlier editions of the Oxyrhynchus papyri. 
This results in a pragmatic and fluid norm for encoding variation. 

Fluid norms are not ideal in a digital environment. That is why it was attempted 
to make regularization more consistent in the DDbDP and in the Papyrological Navi- 
gator. Modern editors and the methods designed for the Papyrological Editor have 
succeeded in standardization of editorial practices in digital editions to a large extent, 
but consistent regularization is not always a straightforward procedure, as I have 


67 TURNER 1980, 71. 
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shown in the sections 3.2 and 4.2. Variation may be governed by various factors and 
this means that the chosen method for regularization may sometimes determine the 
outcome. This causes problems, because there are no guidelines describing a partic- 
ular methodolosy for regularization in papyrus documents. 

Digital technology, however, can do more than standardizing the practices of 
printed editions. I can identify three main aims for encoding linguistic variation in 
papyrus editions: 

1. to show readers of the edition how the editor interprets uncommon forms, 

2. to help papyrologists to search for parallels of words and phrases in various spell- 
ings, and 

3. to provide useful data for linguists studying the Koine Greek language. 


The current practices in editorial regularization and the search interface of the PN do 
not fulfil each of those aims equally well.® The traditional method of regularization 
is not suitable to encode variation consistently and objectively. In order to achieve 
objectivity we need to apply the same treatment to all forms rather than to rely on the 
judgements of individual editors to identify which forms are ‘uncommon’ enough. 
One could do this by providing a reference to a headword, i.e. a lemma, to every word 
that is attested on a papyrus. It is already possible in EpiDoc to annotate a ‘lemma’ 
attribute to every linguistic ‘token’. A hyperlink to a lemma can be very helpful for 
less experienced users and additional morphological annotation would give an op- 
portunity to the editor to explain how every form should be interpreted. The encoding 
of lexical and/or morphological information for every word can be similar to the cre- 
ation of onomastic and prosopographical references to every proper name.” The 
lemma could be in classical orthography, as it is not meant as a correction or regular- 
ization, but only as a reference point for all variant spellings. A search query would 
yield all attested variants of the lexeme or morpheme in question. Such an overview 
of attested variants and their chronological and geographical contexts will show 
which form might have been in common use at any given time. Full text search que- 
ries should ideally separate between the results based on real attestations on a papy- 
rus and results including supplements of abbreviations and restorations in lacunae 


68 Current search results in Papyrological Navigator do not give the number of attestations, but only 
the number of texts in which one or more attestations can be found. Furthermore, the Papyrological 
Navigator does not allow searches in the main text only; comments and regularizations in the appa- 
ratus are always included among the search results. Hence, the number of found attestations of stand- 
ard forms is biased due to the high number of regularizations in the apparatus, as supplemented ab- 
breviations and as restorations in lacunae. Real attestations can only be distinguished from examples 
in lacunae and in the apparatus by going through the search results manually. This especially affects 
the practicalities of the second and third point of the aims mentioned above. 

69 See BODARD 2010. 

70 Cf. Trismegistos People, http://www.trismegistos.org/ref/index.php, and BROUX — DEPAUW 2015. 
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added by editors. This will provide the papyrologist with a more realistic picture of 
the language used in papyri and this will benefit new editions in the future. 

Marking linguistic variation is not a bad idea in itself, nor is the attempt to stand- 
ardize editorial practices in digital editions. However, in order to reach the full poten- 
tial of these approaches, they need to be applied more rigorously and more objec- 
tively. Editorial regularizations that have been annotated up to now should not be 
discarded, but can be used to establish automatic recognition of the lemmata and 
their potential variants. Once such a digital tool is functioning properly, only the most 
uncommon forms would still need to be annotated manually, comparable to the orig- 
inal practice of regularization. There are different technological solutions and several 
possible platforms that would be suitable to achieve these aims. Until that moment, 
papyrologists and linguists will be able to explore the rich source of linguistic varia- 
tion available in Trismegistos Text Irregularities, a collection of a long history of edi- 
torial regularization. 
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1 Introduction 


There currently exist two main digital repositories for papyri: the well-known one of 
the Integrating Digital Papyrology (IDP) Project (http://papyri.info) and the most re- 
cent one of the project Digital Corpus of Literary Papyrology (DCLP) (http://www.lit- 
pap.info). Both corpora, which can be downloaded from GitHub (https://github.com/ 
papyri and https://github.com/DCLP, respectively), contain papyrological texts en- 
coded following the de facto standard EpiDoc schema, a subset of the TEI schema 
specifically designed for encoding ancient documents!. 

The notorious complexity of papyrological texts is also reflected in their digital 
encoding, which challenges the digital humanist in many respects. The major prob- 
lem is represented by the fragmentary nature of most papyri?. Papyrologists work to 
integrate texts to the best of their knowledge: sometimes just a few characters are 
missing, while other times full words need to be supplied, for a text to become mean- 
ingful (and even integrating single words is often not enough to understand a text). 
The degree of certainty for a given integration, then, varies depending on a plethora 
of factors, including especially the papyrologist’s expertise. 

The challenge posed by text integration deeply affects the TEI/EpiDoc XML en- 
coding of texts, where specific elements, such as the supplied and unclear ones, 
are used to mark up additions. In both corpora, the markup is added inline (instead 
of standoff)’. As a consequence, since fragmentation in papyri often occurs at the 
word level, the XML markup can break up words: this phenomenon, as is known, 
raises computational issues when it comes to tokenizing while attempting to preserve 
a link to the information contained in the markup of the original document. 

In this paper, I document a first attempt to add morphological annotation and 
lemmatization automatically to all Papyri.info documents. I restrict my present con- 
tribution to this repository because this is the only one containing some texts which 


1 Cf. REGGIANI 2017, 222 ff. On the DCLP see also the chapter by R. Ast and H. Essler in the present 
volume. 

2 See N. Reggiani in this volume, 8 4.4. 

3 See, for example, BANSKI 2010. For recent XPointer-related work see https://github.com/hcay- 
less/tei-xpointer.js (H. Cayless) and https://raffazizzi.github.io/ coreBuilder/ (R. Vigilanti) 


8 Open Access. © 2018 Giuseppe G. A. Celano, published by De Gruyter. [CORANA] This work is licensed under 
the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110547450-008 
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have also been manually annotated in the Sematia treebank^. The Sematia texts pro- 
vide us with a gold standard which can be used to measure how well the MATE tagger’ 
(trained on literary texts) and a rule-based lemmatizer are expected to fare with re- 
spect to my automatically annotated corpus, which will henceforth be referred to as 
the MALP corpus (= M(orphologically) A(nnotated) (and) L(emmatized) P(apyri) cor- 
pus), available at https://github.com/gcelano/MALP. The MALP texts preserve the 
URNs of the original documents and the line break reference system for each token. 
Because of the complexity of the TEI inline markup in the Papyri.info files, I have not 
attempted to create stand-off annotations’. 

It goes without saying that having the possibility to search the corpora using mor- 
phological features and lemmas will positively impact our knowledge of the texts. 
This holds particularly true for a corpus containing a morphologically rich language 
such as Ancient Greek, which cannot be easily queried using simple graphic words. 
Moreover, morphological annotation and lemmatization dramatically speed up the 
treebanking process, the annotators taking advantage of some annotation being al- 
ready present and correct to a large extent. 

Since the Papyri.info corpus currently amounts to 62,901 documents, the only do- 
able way to add morphological annotation and lemmatization to all tokens is to rely 
on an automatic annotator. Currently the best performing POS tagger available for 
Ancient Greek is the MATE tagger, which has been trained on the literary texts of the 
Ancient Greek Dependency Treebank (AGDT)’. The accuracy measured is 8896*. Com- 
parable accuracies have been reported for the recent UDPipe tagger’, which however 
adopts an annotation schema different from that of the AGDT and the Sematia tree- 
bank. Lemmatization will be performed using a rule-based approach relying on the 
Morpheus morphological analyzer/dictionary”, from which lemmas can be extracted 
by linking it to the papyrological texts via word forms + fine-grained POS tags. 

In Section 2, I will present the Sematia treebank, which contains morphosyntactic 
annotation for some of the papyri contained in the Papyri.info corpus: they will serve 
as a gold standard to evaluate the accuracies of POS tagging and lemmatization. In 


4 Cf. VIERROS — HENRIKSSON 2017, and M. Vierros in this volume. I am aware of the existence of mor- 
phosyntactic annotations for the Herculaneum papyri (Philodemus Project at Wiirzburg), but they are 
not open annotations, and therefore cannot be reused. 

5 BOHNET — NIVRE 2012. See also http://www.ims.uni-stuttgart.de/forschung/ressourcen/werkzeuge/ 
matetools.en.html. 

6 See BANSKI 2010 for the issues related to TEI stand-off annotation. 

7 BAMMAN - CRANE 2011. Available at http://perseusdl.github.io/treebank_data. 

8 CELANO — CRANE — MAJIDI 2016. 

9 http://ufal.mff.cuni.cz/udpipe/users-manual. 

10 https://github.com/gcelano/LemmatizedAncientGreekXML/tree/master/Morpheus. See also 
CRANE 1991 
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Section 3, I detail the sentence-split and tokenization of the papyrological texts. Sec- 
tion 4 shows how the morphological annotation has been added to each token, while 
Section 5 details the lemmatization process. Section 6 contains conclusive remarks. 


2 Agold standard for linguistic annotation of 
papyri: the Sematia treebank 


The Sematia treebank (https://sematia.hum.helsinki.fi/user)" provides us with some 
semi-automatically annotated papyrological texts, which can be used to evaluate the 
accuracies of the POS tagging and the lemmatization of the MALP corpus. The tree- 
bank currently contains 224 papyrological texts annotated for their morphology 
(semi-automatically) and syntax (manually) according to the Guidelines for the Anno- 
tation of the Ancient Greek Dependency Treebank 2.0”, which were first designed for 
the annotation of the AGDT. The original texts come from the Papyri.info corpus, but 
no criterion for the choice of the kind of text is at the moment followed in the tree- 
bank. 

The morphological annotation and lemmatization process are performed using 
the Morpheus morphological analyzer, which was also used to help the manual an- 
notation of the AGDT texts. Since the MATE tagger has been trained on the texts of 
the AGDT, both the Sematia texts and the output of the MATE tagger are directly com- 
parable without conversion. 

The Sematia treebank contains two layers of annotation for each papyrological 
text: the “original” layer, which only consists in the linguistic material that has been 
preserved in a papyrus, and the “standard” layer, which is built on the “original” 
layer and integrates it with editorial work aimed to make the text intelligible. Our au- 
tomatic annotation is evaluated only against the texts of the standard layer: they pro- 
vide a much better input for the POS tagger, the text having been integrated with 
missing characters/words. 

Moreover, while the Sematia treebank contains different annotation files for each 
hand recognized within a papyrological text, I provide only one tokenization per text 
in the MALP corpus, considering the contributions of different hands as part of the 
same text, ona pair of any other integration. These texts were not used for comparison 
purposes to simplify the evaluation process (more precisely, 17 standard layer Se- 
matia texts have been excluded from the evaluation phase, i.e. the ones with the fol- 
lowing URNs (contained in <title/>): o.claud.1.139, o.claud.1.148, o.claud.2.227, 
upz.1.7, p.grenf.2.15, upz.1.59, and bgu.3.994. Moreover, the Sematia texts with URN 


11 See the chapter by M. Vierros in the present volume. 
12 CELANO 2014. 
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cpr.30.28, o.claud.1.131, and o.claud.1.135 are also excluded in that the Papyri.info 
counterpart of cpr.30.28 does not contain terminal punctuation marks allowing (au- 
tomatic) sentence detection, while o.claud.1.131and o.claud.1.135 contain Latin texts. 

The number of Sematia texts which can be used for comparison with the ones 
automatically generated for the MALP corpus is therefore reduced to 92 (i.e., 112 
standard layers texts - 17 — 3): they are henceforth referred to as the Sematia compar- 
ison corpus, which is compared with the MALP comparison corpus, i.e. a subset of 
the MALP corpus consisting in the exact same texts but sentence-splitted, tokenized, 
POS tagged, and lemmatized by using the algorithms developed for this study (avail- 
able at https://github.com/gcelano/MALP). 

The Sematia texts have been sentence-splitted and tokenized using the Arethusa 
annotation tool. Punctuation in Sematia texts has been partly edited manually, so the 
number of sentences is necessarily different from that one gets by simply sentence- 
splitting the original texts found in the Papyri.info corpus. More precisely, the Sematia 
comparison corpus contains 486 sentences, while the MALP comparison corpus 461 
sentences. 

The intersection between these two sets of sentences is represented by 204 sen- 
tences which are exactly the same, i.e. contain the exact same tokens (with the exclu- 
sion of the elliptical tokens manually added by annotators in the Sematia treebank, 
which are simply ignored). I will use these 204 sentences to evaluate the accuracy of 
POS tagging and lemmatization for the MALP corpus. Even if the comparison corpus 
is very small, it will provide us with a rough preliminary estimate of the quality of the 
POS tagging and lemmatization for the MALP corpus, which can be easily calculated 
automatically. 


3 Sentence split and tokenization 


Sentence split has been achieved using a rule-based algorithm identifying sentences on 
the basis of presence of the following terminal punctuation marks: the period, the semi- 
colon (which, in Ancient Greek, has the value of a question mark), and the dot above the 
line (whose function is comparable to that of the colon and semicolon in English). Im- 
portantly, some ancient punctuation marks are encoded in the text as XML elements, such 
as<g type="mid-punct"/>.Ihave converted all<g type="mid-punct"/> into 
dots above the line, which serve here just as generic terminal punctuation marks. 

In the Papyri.info corpus there is a huge variety of g elements, which are used to 
encode glyphs”. Their meaning is not always clear with respect to sentence-splitting, 
and so they are ignored in the present study. A refinement of sentence-splitting will 


13 See REGGIANI 2017, 252 and N. Reggiani in this volume, 8 4.7. 
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be possible by modifying the XQuery module at the relevant points provided at 
https://github.com/gcelano/MALP. 

The XQuery algorithm needs to also be improved to correctly distinguish periods 
used as final punctuation marks and periods used as abbreviation punctuation 
marks: e.g., a single letter followed by a period is taken by rule as an abbreviation 
(and so the period is not tokenized). Papyrological texts can however contain a con- 
siderable amount of Ancient Greek numbers (expressed via single alphabetic charac- 
ters), which are mis-tokenized if they are followed by a period at the end ofa sentence. 

More in general, punctuation in papyri is a very complex matter, which should def- 
initely receive much more attention than the one so farreceived both in the TEI/EPIDOC 
XML text encoding process and in the present study, where only ‘standard’ punctuation 
marks already present in the edited Papyri.info files have been taken into consideration, 
without any attempt of developing a more complex system for sentence detection. This 
has as a consequence that sentence-splitting in the Sematia treebank, which has been 
manually checked, is arguably more accurate (even if not uncontroversial). 

Tokenization has also been performed using a rule-based approach. Ancient Greek 
graphic words (i.e. space-separated words) are commonly treated as separate tokens. 
There are only a few arguably rare exceptions to this statement: e.g., crasis can be re- 
sponsible for the phonetic and graphic merging of an article and a noun, as in OnpEepa 
(= Tf ñušpo). The algorithm does not attempt to provide a solution for that (and the POS 
tagger itself has been trained on data where other-than-space-based tokenization is not 
always consistent). 

Since the TEI/EpiDoc XML files contain inline markup, one major tokenization-re- 
lated task is to distinguish the ‘data text’ from the ‘metadata text’. For example, the TEI 
note element contains some additional information about a part of the text, as shown in 
the file o.amst.8.xml (within the Papyri.info repository), where, with respect to a few char- 
acters, the <note> element specifies “Writing perpendicular to main text”. The text node 
of the note elements should clearly not be tokenized, not belonging to the main text. 

A few TEI elements can contain textual content which has been added by an edi- 
tor and can be considered as part of the text: e.g., the <supplied> element allows 
integration of missing characters. In jur.pap.36.xml, for example, the textual evi- 
dence én. xoAoupévn is directly followed by the element «supplied reason= 
"lost">c</supplied> to signal that a final sigma got lost but is necessary to 
properly understand the text. A more complex case is represented by the element 
<choice/>, which is designed to allow different readings in a text": for example, 
<choice><reg>1fic</reg><orig>1f©v</orig></choice> on line 11 in 
jur.pap.36.xml shows that two different variants of the feminine article are possible 
at that point of the text, being one the original text on the actual papyrus and the 


14 Cf. REGGIANI 2017, 237 and N. Reggiani in this volume, $ 4.6. 
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other one the modern ‘regularization’ of the non-‘standard’ spelling. When tokeniz- 
ing, one should integrate the variants into the text one by one. 

All the details about the preprocessing of a papyrological text aimed to clean it up 
before tokenization are contained in the XQuery module. More precisely, the function 
lp:clean-markup () deletes those elements which do not contain data text (such as 
the <note> or <bib1> elements) and shows the logic followed as for those elements 
containing mutually exclusive data text alternatives, such as the choice element. The 
XQuery module also details the rules for the tokenization itself. One tokenization problem 
which has not been dealt with in the present study is the case of those words splitted on 
different lines, as in o.narm.3.xml, where the word xodaxeiag is splitted into xoAo (line- 
break 2) and xe (xc (linebreak 3). The XQuery algorithm treats the two divided parts as 
different tokens, in that they are identified by a different line break number (8n in<1b/>), 
which is taken to be a major identifying reference point for a token (i.e., a token cannot 
span over two linebreaks). Future work should try to address this problem. 


4 POStagging 


POS tagging is the NLP task aimed to add morphological analyses to tokens. A model 
for the MATE tagger has been trained on the AGDT, so it is possible to tag Ancient 
Greek texts automatically. The model has been trained on literary texts, i.e. texts 
whose language is quite different from that of papyri. 

The token accuracy for POS tagging using the MATE tagger has been compared 
to the baseline accuracy consisting in assigning to each token the most frequent mor- 
phological tag associated to it in the AGDT. As explained in Section 2, the comparison 
is performed using the 204 matching sentences of the Sematia/ MALP comparison cor- 
pus as a gold standard. The total number of comparable tokens is 1,839. The word 
forms of the tokens of each sentence have been compared as for their POS tags and 
morphological features. 


Baseline MATE tagger 
0.56 0.62 


Tab. 1: token accuracy for POS tag + morphological features. 


Baseline MATE tagger 
0.67 0.76 


Tab. 2: token accuracy for POS tag only. 


An Automatic Morphological Annotation and Lemmatization for the IDP Papyri — 145 


In order to calculate the baseline accuracy we used the most frequent tags found in 
the AGDT and, ifa word form is not present, the first POS tag + morphological features 
found in the Morpheus dictionary for that given word form. There is no clear order of 
entries for each token in the Morpheus dictionary, so this latter POS tag + morpholog- 
ical features assignment can be considered random. 

The tables above show that having a POS tagger both for POS tags only or POS 
tags + morphological features is useful to get better accuracies, even if the figures are 
admittedly not very high. 

One explanation for these low accuracies is that the vocabulary found in papyro- 
logical texts is very different from the one found in the AGDT: the distinct values of 
the tokens of the 204 matching sentences contained in the Sematia/MALP compari- 
son corpora are 781, of which only 361 are found in the AGDT data (used to train the 
MATE tagger). Similarly, the Morpheus morphological analyzer can recognize only 
395 tokens (out of the 781 distinct values). This is clear evidence that papyrological 
texts contain very different vocabulary from literary texts, which deserve special at- 
tention and development of specific resources. 


5 Lemmatization 


An attempt of lemmatization is performed using a database consisting of entries from 
the Morpheus morphological analyzer/dictionary and the Perseus-Under-Philologic 
morphological dictionary (available but not downloadable at http://perseus.uchi- 
cago.edu), which is a refinement of Morpheus, with correction of its errors and addi- 
tion of new entries. 

A typical entry of the database is like the following: 


«d n="500083" v="a-s---fa-#auetagivotov"> 
<p>a-s---fa-</p> 
<f>apetaxivntov</f> 
<l r="a--s---fa-">dapetaxivyntoc</1> 
<e>dapetaxivntoc</e> 
</d> 


Each d element is identified by a word form + morphological analysis, which a lemma 
from the Perseus-Under-Philologic dictionary (in the 1 element) and/or the Morpheus 
dictionary (in the e element) are associated to. I extracted from this database all the 
lemmas of those entries whose word forms and morphological analyses correspond 
to those of the MALP tokens (belonging to the 204 matching sentences). 
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Lemmatization 
0.47 


Tab. 3: token accuracy for lemmatization. 


The result of the comparison between the lemmas in the Sematia sentences and 
the ones in the MALP sentences (containing lemmas added using the above men- 
tioned database) is as low as 0.47. This result seems to be due to the low accuracy of 
POS tagging because the lemmatization process heavily relies on it: if the morpholog- 
ical tag associated to a word form is wrong, then the correct lemma is unlikely to be 
retrieved from the database. Another reason can be the fact that the vocabulary cov- 
ered by the dictionaries is not complete: however, if the Morpheus dictionary only 
recognizes 395 tokens of the distinct values of the Sematia/MALP matching sen- 
tences, the Perseus-Under-Philologic dictionary has a much better coverage, which 
amounts to 582 tokens. 


6 Conclusions 


In this paper I have documented an attempt to add morphological annotation and 
lemmatization to the Papyri.info corpus automatically. The results show that fine- 
grained POS tagging accuracy is 0.62, while lemmatization accuracy is as low as 0.47. 
The fine-grained POS tagging accuracy is likely to be explained because of the vocab- 
ulary difference/coverage between the literary texts of the AGDT, which the MATE 
tagger has been trained on, and the papyrological texts (the AGDT recognizes only 
361 tokens of the 781 distinct tokens of the Sematia/MALP matching sentences). This 
has a direct consequence on the lemmatization accuracy, which has been performed 
using a rule-based approach depending on the POS tagging: if the association word 
form-POS tag is wrong, then it is very unlikely that the right lemma can be retrieved. 
In order to get better results both for POS tagging and lemmatization, a papyri-spe- 
cific treebank, such as the Sematia treebank, should be expanded, so that it can pro- 
vide enough data for the training of new models (for POS tagging/lemmatiza- 
tion/parsing). Even if the accuracies calculated for the MALP corpus are not high, still 
the corpus is expected to be useful for data exploration. 
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1 Introduction 


Texts are made of words, the meaning of the words creates the text, so that under- 
standing the words means understanding the text. This assertion may sound obvious, 
but it actually hides a deep truth: reaching a more exact definition of the words can 
help to reach a more exact meaning of the whole passage in which they occur. Hence, 
the need for creating tools specifically conceived to study the words and their tight 
connection with the texts. In the digital and internet-led era we are living in, elec- 
tronic technologies have been profitably applied to the Humanities and intersect with 
them in the scholarly field of the Digital Humanities (DH) to such an extent that it is 
hard to disagree with Jerome McGann’s incisive words: 


As with the Renaissance sped forward by the printing revolution of the fifteenth century, digital 
technology is driving a radical shift in humanities scholarship and education. The depth and 
character of the change can be measured by one simple but profound fact: the entirety of our 
cultural inheritance will have be reorganized and re-edited within a digital horizon.! 


The present contribution falls into the DIGMEDTEXT project (ERC-2013-AdG no. 339828) funded by the 
European Research Council at the University of Parma (Principal Investigator: Prof. Isabella Andorlini; 
see http://www.papirologia.unipr.it/ERC), but the work for this paper has been completed during my 
current Post-doctoral Fellowship at the North-West University of Potchefstroom, South Africa. 


1 MCGANN 2010, 1. For a good overview of the issues entailed by Digital Humanities, cf. SCHREIBMAN 
— SIEMENS — UNSWORTH 2004. 


8 Open Access. © 2018 Isabella Bonati, published by De Gruyter. [CERANO] This work is licensed under the 
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110547450-009 
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In this context, online dictionaries and reference tools, as results of an e-lexico- 
graphic process,’ may acquire a special relevance and potential, notably if these re- 
sources are linked to a corpus of digital editions of texts. This is precisely the case 
with Medicalia Online, a digital lexicographical database of technical terms attested 
in the Greek medical papyri.? Medicalia Online is indeed strictly connected to the dig- 
ital editions hosted in the Corpus of the Greek Medical Papyri Online (CPGM),^ the dig- 
ital library of ancient medical texts on papyrus recently merged into the Digital Corpus 
of Literary Papyrology (DCLP).5 Like the CPGM project, Medicalia Online has been de- 
veloped at the University of Parma (Italy), from 2014 to the end of 2016, in the frame- 
work of the ERC project DIGMEDTEXT funded by the European Research Council 
(Grant Agreement no. 339828) and directed by Professor Isabella Andorlini.$ Thus, 
given the close interconnection with the core database, Medicalia Online can be con- 
sidered as both a supplement to and an expansion of the digitized corpus of the Greek 
medical papyri, as I will illustrate below. 


1.1 A matter of definition 


Medical papyri represent a corpus of peculiar texts with a peculiar nature, that ranges 
from literary texts, notably treatises by known authors and adespota, since papyrus 
fragments not rarely preserve works more or less of the same status as the medical 
literature transmitted in medieval manuscripts, to technical texts conceived to con- 
vey technical knowledge, for instance technical handbooks, collections of recipes, 


2 To explore the state of the art in the field of (e-)lexicography, see TARP 2008, FUERTES-OLIVERA — 
BERGENHOLTZ 2011; FUERTES-OLIVERA 2013; FUERTES-OLIVERA — TARP 2014. 

3 Cf. http://www.papirologia.unipr.it/ CPGM/medicalia/vocab/index.php. See also the following 
contributions: BONATI 2018a and 2018b, as well as REGGIANI 2017, 129—30 and 275. 

4 On the CPGM Online, see the contribution by N. Reggiani in this same volume. Cf. also REGGIANI 
2015 and 2018a. For a full insight of a typical CPGM digital edition, see BERTONAZZI 2018. For other 
references on digital papyrology and online resources and project, see in particular: ANDORLINI 1997a, 
1997b, as well as RAMSAY 2004; DEL CORSO 2007; MAGNANI 2008; ANDORLINI — REGGIANI 2012; DELATTRE 
— HEILPORN 2014; DEPAUW — GHELDOF 2014; REGGIANI 2012, 2017 2018b; SVENSSON — GOLDBERG 2015. 

5 Cf. http://www.litpap.info. Cf. REGGIANI 2017, 251-3. 

6 Main reference website: http://www.papirologia.unipr.it/ERC. Cf. REGGIANI 2017, 256 and 273-5. 
The very first steps in the creation of the Medicalia Online database date back earlier, when I spent a 
research stay at the University of Oslo (from August 2012 to April 2013), supported by an Yggdrasil 
Grant from the Norwegian Research Council. There, I had the pleasure to work in close collaboration 
with Prof. Anastasia Maravela, who gave fundamental suggestions and a great contribution to the 
development of the current layout of the entries. During that first phase, I focused on the vocabulary 
of some representative Greek medical containers (on the topic see BONATI 2016a), which I used as 
samples to test advantages and disadvantages, usefulness and usability of the lemmas in Medicalia 
Online. Beside me and Prof. Maravela, other contributors in the project have been Prof. Isabella An- 
dorlini, Dr. Nicola Reggiani and Dr. Francesca Bertonazzi. 
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school manuals and catechisms, to proper documentary texts, such as public physi- 
cians’ reports, petitions of private individuals and private correspondence concern- 
ing matters of health and diseases. So, besides the strictly literary texts, such a corpus 
mostly includes texts with a ‘borderline’ character, viz. combining features of papy- 
rus documents with issues proper of the technical nature of medical writings, that are 
categorized — as in a sort of ‘twilight zone’ — as ‘paraliterary’ or — in a vaguely pejora- 
tive way - ‘subliterary’.’ Due to this complex and stimulating textual situation, it be- 
came clear since the beginning that a dictionary of short definitions of terms would 
not have fitted the exegetical requirements of ancient medical discourse. This en- 
tailed the necessity to broaden the goal of Medicalia Online to produce a rigorous and 
detailed reference collection of relevant lemmas critically discussed. An extensive 
and diachronic treatment of ancient Greek medical terms was indeed still missing 
from the scholarly landscape. It was decided to focus the attention on a selection of 
specimina with the aim of providing not merely brief explanations of many words, 
like in an ordinary ‘dictionary’, but a series of in-depth studies on selected terms. As 
a consequence, it is preferable to define Medicalia Online not as a simple ‘glossary’ or 
‘dictionary’, but rather as a ‘lexicographical tool? containing entries or articles with 
an encyclopaedic flavour, or, even more specifically, as a ‘specialized lexicographical 
tool’, being it devoted to the specific set of linguistic and factual elements of the spe- 
cialist subject field of ancient medicine. 

Considering the vastness of the lexical material at disposal, since the papyri are 
a treasure-trove of linguistic information, the lexicographical process is still ongoing 
and potentially never-ending. Due to this aspect, Medicalia Online may fall into the 
category of the lexicographical tools “under construction" or better “dynamic”,’ ac- 
cording to the terms “Ausbauwérterbuch” (“dictionary under construction") vs. “Ab- 
schlussworterbuch” (“completed dictionary") introduced by SCHRÖDER 1997, 60, and 
“dynamisches Wörterbuch” (“dynamic dictionary") vs. "statisches Wörterbuch” 
(“static dictionary") preferred by LEMBERG 2001, 81. This means that Medicalia Online 
is not “a fixed object", but a flexible entity, “an organic changing database"? that can 


7 Cf. REGGIANI 2017, 78. 

8 For a definition of ‘lexicographical tool’, used instead of ‘reference work' to express “a superior 
concept for both printed and electronic dictionaries", see TARP 2008, 123: “a lexicographical tool is a 
tool that can be used via consultation or passive searching by users with a specific type of communi- 
cative or cognitive need to gain access to lexicographical data, from which they can extract the type 
of information required to cover their specific needs". 

9 On computer-lexicographical process for online dictionaries under construction, cf. KLOSA 2013, 
519-22. 

10 PRINSLOO 2001, 141. 
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be continually enlarged, as well as that its lexicographical process is an “open sys- 
tem”.! Such a flexibility and the possibility of a constant improvement and updating 
represent the undeniable advantage of an online publication. Nevertheless, most of 
the entries already published online in the Medicalia Online database are going to be 
published also in print, in form of a collection of lexical studies, as part of a volume." 


1.2 Methodology and aims 


There are some keywords characterizing the methodology of Medicalia Online. The 
first and most important one is ‘interdisciplinarity’. The significance of interdiscipli- 
narity in the new trends of Papyrology has been stressed several times in recent years. 
Suffice it to remember how often expressions like “broader concept” and “broader 
view”, as well as words like “combination” of sources, and, of course, “interdiscipli- 
narity” occur in The Oxford Handbook of Papyrology edited by Roger Bagnall in 2009. 
So, whilst the core focus of Medicalia Online is papyrological and the evidence of med- 
ical papyri plays a leading role, a systematically interdisciplinary approach inspires 
the inner nature of its lexical studies. This contributes to broaden the horizon of the 
database to a wide range of perspectives, since it provides at the same time a papyro- 
logical, linguistic, archaeological and historical-scientific overview of the studied 
items. Such a methodology, indeed, involves a strong sense of dialogue and cooper- 
ation among disciplines, merging and combining components of several subject ar- 
eas beside papyrology: classics and history of textual transmission, digital humani- 
ties, linguistics, epigraphy, archaeology and material culture, history of science and 
of medical practices across the ages. It entails a critical analysis and a comparative 
examination of all the typologies of sources on which the study of the ancient medi- 
cine draws upon, from the written ones, i.e. papyri, literary passages (first and fore- 
most works on medical topic, but also any other author in which the terms appear), 
inscriptions and tituli picti, to the available archaeological discoveries attesting to 
medical practice. Ultimately, this integrated approach, which bridges together the 
main subject areas in ancient studies, enables us to throw new light on the complex 
and multicultural setting of the Greco-Roman medicine in Egypt, and presents - to 


11 Cf. KLOSA 2013, 519: “producing an online dictionary may begin before the phase of writing is fin- 
ished: online dictionaries can be published step-by-step. Thus, all phases of the computer-lexico- 
graphical process (planning - writing — producing) merge, giving yet unknown flexibility to the lexi- 
cographer. [...] While other lexicographic processes lead to an end (i.e. the publication of the 
dictionary), theoretically, working on an online dictionary under construction could go on forever. 
An online dictionary under construction is an open system”. 

12 Cf. BONATI— MARAVELA 2018. 
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borrow Vivian Nutton's words - “an inclusive model for understanding the medical 
word of Antiquity". 

The second keyword of Medicalia Online is *verticality'. Such a comparative and 
thorough, i.e. ‘vertical’, approach to the sources contributes to improve our under- 
standing of the Ancient World with its textual and concrete aspects, with its verba 
and its realia, and promotes an essentially ‘vertical’ rather than ‘horizontal’ dimen- 
sion in investigating lexical items. Thus, this sort of ‘archaeology of the words’ also 
translates into the effort to ‘revitalize’ the past, making it more accessible to the pre- 
sent time. 

Furthermore, starting from the evidence of the papyri, particular attention is de- 
voted to the evolution and survival of the examined words into the modern languages 
and contemporary scientific discourse. Thus, one of the goals of Medicalia Online is 
to focus on the diachronic, often problematic developments of the Greek technical 
vocabulary tracing its trajectory from antiquity to modern times. 

A further aspect concerns the analysis of the relationship, viz. the points of diver- 
gence and contact, between the terminology attested by the papyri of medical content 
and the - often more sophisticated - technical language known through the medieval 
manuscript transmission of the ancient medical writers, from the Hippocratic authors 
to the compendiasts of Late Antiquity. In this view, the lexical studies of Medicalia 
Online allow us to explore the contribution of the papyri to our knowledge of the 
Greek medical language. 


1.3 The database and the entries 


The database is built on the open source vocabulary server TemaTres and is browsa- 
ble in different ways. The home page displays a threefold subdivision by macro-cate- 
gories, each of which is further divided into subcategories providing a taxonomical 
classification of the terms: “Lexicalia”, i.e. word typologies (e.g. containers, ingredi- 
ents, instruments, termini technici), “Medical branches" (e.g. gynecology, ophthal- 
mology, pathology, pharmacology, surgery), “Text typologies" (e.g. adespota, cate- 
chism, documentary texts, prescription). A single term can also be subordinated to 
two or more subcategories, so that it can be searchable in each of them. Another way 
is to browse the terms and the categories alphabetically by clicking a certain Latin or 
Greek letter either at the top or at the left bottom of the home page. Finally, on the 
very top, it is also possible to use a full-text search, as well as an “Advanced search". 
In the latter case, a drop-down menu provides a submenu of navigation items to select 
the research scope: “Term” restricts the search to the headwords, *Meta-term" to the 


13 NUTTON 2004, 16. 
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categories, “Non-preferred term” to secondary headwords such as variants and di- 
minutives, and “Note” to the thematic boxes. 


Medicalia Online 


AB C DEGNI CHK OPaARETFUABEKA 0 NI TY 
© xw 
» «Lexicalia» 

«Medical branches» 


» «Text typologies» 


00606000800000 Autor IGAAEOITENT rjc (ORC 0 
L Author: DIGMEOTEXT Project (ERC GA 339628. PI 
B Prof. Isabella Andorirs 
090006000008000 
[engen bal 


Fig. 1: The database home page. 


Medicalia Online 


Advanced search 


What search? 
Search term 

Exact phrase 

Mas this top term 
Note type 

Created on or after 


Is located in deep 
level 


00606000800000 Acer: DIGUEDTEXT Proje (ERC G 
590000800008000 


Fig. 2: The advanced search inferface. 
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The lexicographical structure of the lemmas is innovative and reflects the interdisci- 

plinary, integrated approach that inspires Medicalia Online. The layout of the entries 

is conceived to offer a broad overview of the examined words and is essentially com- 
prehensive, but, at the same time, it is user-friendly and applicable to any lexical cat- 
egory and semantic field. User-friendliness is indeed an important prerequisite when 

a lexicographical tool is complex and involves a conspicuous bulk of information. 

Each lexical entry consists of a fixed set of thematic boxes (“notes”), as follows: 
“Variants” includes a list of variants, both grammatical (e.g. diminutives) and 
phonetic/spelling variants as found in the papyri, the Latin transliteration or 
form(s) of the term, and the cognates of medical relevance, if any; 

— “General definition” gives a dictionary-like definition usefulto provide thereader 
with the main information concerning the searched term and its ‘immediate’ 
meaning before (or in case) (s)he goes on reading through the full lemma; 

— “Language between text and context” is a linguistic section containing discus- 
sions on etymology, morphology, semantics, variants and cognates of the exam- 
ined term, but it also discusses its linguistic history up to modern times and the 
diachronic developments of its technical meaning(s); 

— “Testimonia — a selection of representative sources” lists some Greek and Latin 
passages from all kind of written sources (literature, papyri, inscriptions) in 
which the term is attested, selected according to their medical relevance. Each 
passage is accompanied by an English translation; 

— “Commentary” is the most substantial section of the entry and is aimed at con- 
textualizing the term in its textual and historical-scientific background. In order 
to do this, the section is divided into two chapters. The first one (*[the term] and 
its medical sources") traces a detailed overview of what the ancient sources attest 
about the term, also scrutinizing the possible changes of its semantic value over 
time from its earliest attestations to Late Antiquity, and the comparison between 
its ancient and modern meaning(s). The second chapter (“[the term] in practice") 
is specifically focused on the ‘practical’ side of the examined item and outlines 
the connection between the word and its concrete dimension. To make just some 
examples, this means the material reconstruction of the related object in case of 
words denoting res medicae, such as containers employed to prepare or store 
remedies or surgical implements, and the methods of treatment and surgical pro- 
cedures to be performed when dealing with names of pathologies and disorders, 
with particular attention to parallels, divergences and innovations along the his- 
tory of medicine; 

— “Bibliography” includes "Lexicon entries", i.e. dictionaries, glossaries etc., and 
*Secondary literature", i.e. more extensive studies on that particular topic or 
word. 

— “CPGM/DDbDP reference(s)" lists the papyrological evidence containing the 
word. Since some of the examined terms occur only in the CPGM Online, while 
others appear also or only in documentary texts dealing with medical topics, such 
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as private letters requesting remedies or surgical instruments, which are con- 
tained in the DDbDP, “the documentary evidence (‘DDbDP references’) will be 
linked to the appropriate texts on Papyri.info, the literary or paraliterary one 
(‘CPGM references’) to the forthcoming texts on DCLP, from which, in turn, it will 
be possible to insert links back to Medicalia Online”. 


Finally, a clickable list of the terms connected to the main term (diminutives, vari- 
ants, cognates, Latin forms), which are also searchable through the alphabetical list 
that can be found both at the top and at the bottom of the home page, closes the 
lemma. 


2 The lexicographical database and the digital 
editions of texts 


2.1 The interconnection between Medicalia Online and the 
textual database 


The contribution of Greek and Latin papyri to our knowledge of classical languages 
is an indisputable fact that has been scholarly recognized since their discovery in the 
dry sands of Egypt in the late 19" century. It is worth quoting EVANS — OBBINK 2010, v: 


Every scrap of papyrus and every ostracon or tablet unearthed has the potential to change some 
aspects of the way we think about these languages. Such texts have the capacity to modify our 
understanding of the classical forms of both languages and for their post-classical development 
provide evidence of the most direct kind we shall ever acquire. The richness of the resource can 
hardly be overstated. 


Exactly like the other categories of papyri, papyri of medical content have a massive 
linguistic potential. The corpus of the Greek medical papyri has indeed not only en- 
hanced our knowledge of medical literature and everyday medical practice, revealing 
valuable information on the diseases that affected people in the Egyptian chora, as 
well as their pharmacological and surgical treatment. It has also offered rich attesta- 
tion of Greek technical vocabulary, its diachronic trajectory over time, its registers 
and levels of technicality, from the actual medical Greek written or spoken by medical 
professionals when communicating with their colleagues, to the not properly tech- 
nical but still medical language used in everyday life by lay persons and practising 
physicians.” 


14 REGGIANI 2017, 130. 
15 On the contribution of medical papyri to the study of medical Greek, cf. MARAVELA 2017. 
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In addition to the new attestation of technical terms already well known by 
means of the ‘official’ medical writings transmitted through the medieval manuscript 
transmission, medical texts on papyrus often refine our knowledge of weakly attested 
or extremely rare words and bring back to light elements of Greek medical vocabulary 
previously lost and completely unattested in other medical sources. 

An in-depth study of medical micro-language and technical terminology is thus 
essential to deeply understand the texts. For this fundamental reason, to join the dig- 
ital editions of the medical texts on papyrus with the lemmas in Medicalia Online can 
be of the utmost importance to promote an integrated and mutual enrichment. 

An even more in-depth investigation might be realized by adding to the lexical 
studies the analysis of the morpho-syntactic stucture of the texts by means of the ap- 
plication of different levels of linguistic annotation. Annotation is indeed a cardinal 
part of the linguistic analysis of a corpus of texts, the method of describing, recording 
and analysing linguistic phenomena through computer-based text corpora that is bet- 
ter known as Corpus Linguistics.! As stressed by REGGIANI 2016, 2: 


A linguistic corpus is usually intended as a selection of sample texts representative enough of a 
language, and though the medical papyri at our disposal come from a random and incomplete 
selection, they can be considered as the entire reference population rather than as a sample of a 
larger group, so that linguistic annotation seems to me absolutely feasible. The basic annotation 
layer, related to the analysis of the parts of speech (the one also known as treebanking because 
it is usually represented with a tree graph) would allow to conduct an extensive lexical, phrase- 
ological-formulaic and syntactic analysis on the corpus, aimed also (but not only) at discovering 
styles and writing strategies specific of the medical texts, both literary and documentary: think 
only of the possibility to find out influences or interpolations between authors, or the presence 
of literary echoes in technical or documentary texts. To analyse in depth and comprehend the 
syntactic structure of texts would allow also to solve problems of interpretation, or even only to 
understand the exact meaning of a text. 


Thus treebanking, as it is used in linguistics, is a possibility to model how sentences 
are built by creating morpho-syntactic trees. In the field of Classics this kind of lin- 
guistic annotation is now at a very advanced level.” Just to mention two relevant pro- 
jects, The Ancient Greek and Latin Dependency Treebank (AGDT 2.0) is a corpus of an- 
cient Greek and Latin literary works, annotated on the morpho-syntactic and 
semantic layers, which has been developed since 2006 at the Leipzig and Tufts Uni- 
versities by Giuseppe G.A. Celano, Greg Crane, Bridget Almas and others,” while, on 
the more strictly papyrological side, the project Sematia, conducted by Marja Vierros 
and Erik Henriksson at the University of Helsinki, is a platform aimed at facilitating 


16 On this issue, see for example BIBER — CONRAD — REPPEN 1998, FACCHINETTI 2007 and KUEBLER — 
ZINMEISTER 2014, as well as the chapters by N. Reggiani and M. Vierros in this volume. 

17 Cf. REGGIANI 2017, 180-6 with bibliography in n. 13. 

18 See at https://perseusdl.github.io/treebank data. See also G. Celano’s chapter in this volume. 
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the linguistic tagging of digitized documentary papyri through the creation of linguis- 
tic layers from TEI/EpiDoc XML documents.” 

It is worth mentioning another innovative text analysis tool, which might be po- 
tentially useful also in the case of the corpus of Greek medical papyri. I am referring 
to CATMA 5 (Computer Aided Text Markup and Analysis),? a tool developed at the 
University of Hamburg that offers an interesting combination of three main features, 
since it allows collaborative annotation and analysis of a text or a text corpus, it sup- 
ports explorative, ‘non-deterministic’ practices of text annotation, viz. a discursive 
and debate-oriented approach to text annotation based on the research practices of 
hermeneutic disciplines, and integrates text annotation and text analysis in a single 
web-based working environment. 


Fig. 3: CATMA screenshot 


One of the main outcomes provided by the digital tools is the possibility to support 
several kinds of linguistic analysis - lexical, semantic, morphological, syntactic — in 
direct interconnection with the texts, namely directly on the digitized textual editions. 
This allows an - even simultaneous — work of in-depth investigation, abstraction and 
conceptualization on and through the text itself, thus enhancing its deep comprehen- 
sion and interpretation in an immediate, dynamic and interactive way. Immediacy, 
dynamism and interactivity are indeed among the most stimulating features and per- 
spectives of a digital publication. In the case of Medicalia Online, its interconnection 


19 See at https://sematia.hum.helsinki.fi. See also M. Vierros' chapter in this volume 

20 See at http://catma.de. Thanks to the funding made available by the North-West University of 
Potchefstroom, I had the opportunity to be introduced to CATMA during the workshop “Digital An- 
notation and Analysis of Literary Texts. A hands-on introduction to CATMA”, organized by the South 
African Centre for Digital Language Resources (SADiLaR) and held by Prof. Christoph Meister at the 
University of Pretoria (August 21, 2017). 
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with the core textual database may exactly be defined with these adjectives: immedi- 
ate, dynamic and interactive. The expected systematic connection of the examined 
words occutring in the digital editions of the medical papyri to the related lexical en- 
tries in Medicalia Online? may help not only to refine the definition of a term in a 
particular papyrus text, but also to contextualize the term in all the other sources at- 
testing it. Medicalia Online is, thus, so conceived as to integrate the textual database, 
in order to serve as an expansion and a supplement to the digital editions of the Greek 
medical papyri. 

The issue of the integration and interconnection with the main database puts the 
stress once again on the contribution of a close lexicographical examination to an 
enhanced textual understanding. In particular, a thorough lexicographical approach 
can strengthen a double awareness of the examined lexical item within the frame- 
work of the text, as it will be illustrated by means of some selected case-studies: ‘con- 
crete’ on the one hand, ‘textual-philological’ on the other hand. 


2.2 Lexical studies and ‘concrete’ awareness 


As already said, the interdisciplinary and critically comparative methodology of Med- 
icalia Online enables us to explore also the practical and material dimension concern- 
ing the term and its context. This aspect makes it possible to almost physically ‘visu- 
alize’ the concrete reality under and beyond the words, especially thanks to the 
second section of the “Commentary” (C 2), which is explicitly focused on the concrete 
side of the lexical items (see $ 1.3). Thus, this sort of “archaeology of the words’, aimed 
at ‘revitalizing’ the past, makes the ancient texts themselves more ‘living’ and their 
words almost ‘tangible’. 

An example of an object virtually ‘reconstructed’ starting from the evidence of a 
papyrus is provided by the word ópia, commonly denoting a particular type of con- 
tainer used as a jar to carry and pour water.” This primary function, made clear by 
the etymology from béwp, “water”, is well established in the ancient sources. In the 
documentary papyri, this container is also filled with other contents, such as food- 
stuffs.” Only two passages of medical authors refer to the 06pío (and to the diminutive 


21 For the time being it is only possible to create links between the lemma (see section E, 
“DDbDP/CPGM references” mentioned above, $ 1.3) and the words highlighted in the front matter of 
the digital editions in DCLP, as well as, conversely, between the latter and the corresponding lemmas 
in Medicalia Online, but it is expected that in the near future it will be also possible to insert links 
directly into the Greek texts both in DCLP and in Papyri.info, in case of medical terms found in docu- 
mentary papyri of medical content. 

22 Cf. MedOn s.v. and BONATI 2016a, 157-75. 

23 Cf., e.g., PSIIV 428,89-90 and 92 (pomegranates), BGU XIII 2359,10 (beans), P.Oxy. I 155,4 (bread). 
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b6píokn) as a container of pharmaceutical use.” In both cases, the vessels have the 
function of small containers for ointments employed for the storage ofthe therapeutic 
products prior to their use. This function is attested in just one documentary papyrus 
of medical content dating back to the late IV century CE, P.Oxy. LIX 4001. The papyrus 
is a letter written by a certain Eudaemon to his mother, grandmothers and a woman 
called Cyra. Eudaemon, who is a doctor, sends the letter to his surgery presumably in 
Oxyrhynchus, as the address on the back shows (ártó60g siç 16 iatpeiov). He is working 
away from home for professional reasons, and he now asks for the means to make 
some medical implements on his own. He also notifies his family that he received a 
“hydria of eye-salves” instead of a “hydria of animal grease”.* The passages of medi- 
cal content do not provide any information about the physical appearance of the ves- 
sel, but the word vSpia corresponds to a well-recognized type of container in the (con- 
ventional) archaeological vocabulary, and it is one of the most common shapes 
depicted in Attic vase-painting, such as in the famous Francois vase (ca. 570 BC). Ac- 
cording to all types of ancient evidence, the most typical features of this vessel are the 
presence of some handles (usually three: two horizontal side-handles for easy lifting 
and a vertical handle for pouring the water or carrying when empty) and a fairly nar- 
row neck set off from the body. It is significant to stress that the comparative study of 
all the sources — written as well as archaeological — allows us to conjecture about the 
material shape of the dépia in the Oxyrhynchus papyrus, as well as, more broadly, in 
the other two medical passages, even though it does not appear any explicit infor- 
mation about its morphology. First of all, although in these texts of medical content 
vdpia / Vpioxn does not represent an actual terminus technicus in the vocabulary for 
medical containers, it is likely that the word has been used in connection with reme- 
dies having an ointment-like consistency because of the shape and the considerable 
versatility and manageability of this small vessel. Assuming that the D6pia / D6píoxn 
of the medical sources has a narrow neck and a vertical handle like some glass hydris- 
kai for cosmetics and oils found in archaeological contexts,” its morphology seems 


24 Cf. Gal. Comp.med.loc. I 2 (XII 437,2-5 K.) and Paul.Aeg. III 2,2,4-6 (CMG IX 1,132,19-21 Heiberg). 
25 Cf. ll. 22-30 ëoyape(v) | 8È kai Ta GAAG návta ywpiç nóvnç | tfi vapeiag (l. DSpíoç) Tob otvyyeiov 


vepeill . . Jav (l. b6ptav): “we had all the other things too except only the jar of grease. So let our 
brother Theodorus be eager to search for it ... [make sure to look ... ] and to know about it ... jar, he 
provided instead of the grease a jar of ointment". 

26 For these core-formed glass miniatures (hydriskai), imitating the shape of the three-handled and 
narrow-necked pottery hydria and very probably containing scented oils or cosmetics, see BONATI 
2016a, 172 and STERN 1999, 29-39. 
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to be particularly suitable for closing and sealing.”’C:\Users\Isabella\Dropbox\Medi- 
calia Parma\HYDRIA-final version.docx Thus, it is highly probable that these features 
represent the main reason why in the papyrus letter from Oxyrhynchus a little vessel 
named v8pía has been chosen to contain the therapeutic products mentioned: the 
xoAAovpia actually received by Eudaemon and the ó£byytov previously requested by 
him but never dispatched. In all likelihood, these special details of the D6pia attended 
to by Eudaemon's family will have assured proper preservation of the remedies dur- 
ing their transport from the iatpeiov in Oxyrhynchus to the village where Eudaemon 
was apparently working as a physician. 

A further example of this attempt to 'revitalize' and ‘reconstruct’ an object from 
the past may be represented by another name of container, qappako0r|kn.?? The term 
is a semantically transparent compound employed to denote a portable chest or case 
for the storage of remedies and medical implements. The earliest witness of the noun 
is P.Oslo II 54,6 (second half of the II — first half of the III century AD), a private letter 
on papyrus from Egypt (likely coming from the area of Oxyrhynchus), addressed by a 
certain Horeion to his father Apollonios. The term occurs otherwise exclusively in 
astronomical and Christian works from the V century AD onwards.? The lack of this 
term from the medical texts is significant and might suggest that, even if the com- 
pound qoppaxo8rikn indicates an exclusively medical container, viz. a technical ac- 
cessory, it never developed into an actual terminus technicus. It is likely that it served 
as a lay synonym for the physician's tool-case in common language, for instance in 
practitioners’ everyday conversations with their patients, instead of more ‘official’ 
and technical terms for similar objects used among professionals, such as våpðně,” a 
noun which is well known in literary sources, and explicitly defined vàpOn£ iatpikóg 
in aliterary papyrus containing alchemical recipes (PSI inv. 22011,48 = TM 65816), but 


27 This aspect finds a confirmation in the fact that some D6píou Eoppaytopévat used for transporting 
and sending products are mentioned in two documentary papyri, SB X 10559,1 (V AD, ?) and CPR XXV 
25,4 (VI-VII AD, Arsinoites or Herakleopolites). 

28 Cf. MedOn s.v., BONATI 2016b, 663-7 and 2016a, 185-95. 

29 Cf. e.g. Cat.Cod.Astr. I 104,26-30 Olivieri and Procl. Or. XVIII in laudem apostoli Pauli (PG LXV 
817D-820A Migne). 

30 It refers to a narrow cylindrical type of case, named after the Ferula communis (vap8né in Greek), 
with the wood of which it was originally made. This name continued to indicate such cases even when 
made of other materials, and came to designate boxes with different shapes and functions. For a dis- 
cussion on the term and for references to ancient sources and to the bibliography, see especially MAR- 
GANNE 2004a, 122-4. 
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also - though less attested — 5eAtdptov,* £yyeipiótov,? and rñpo.2 As a matter of fact, 
according to SCHIRONI 2010, 338: 


Technical terms often have lay synonyms in common language; this is particular evident in med- 
icine where technical and lay terminology coexist [...] and often physicians use the latter in order 
to be understood by the patients. 


Thus, it seems that the compound qoppaxko80rn might be placed at an intersection 
point between the language of everyday life often documented in the papyri and the 
technical vocabulary for medical containers. Furthermore, even though the extant 
written sources do not provide illuminating information about the shape and the 
physical appearance of pappakoðńkn, archaeological evidence may represent a fer- 
tile ground to start formulating hypotheses. Indeed, a certain number of cases for 
storage and transport of drugs and remedies and/or surgical implements have been 
unearthed in excavations. Among these, a characteristic typology is rectangular (on 
average 12 x 6-7 x 2-3 cm), equipped with a sliding outer lid and internally divided 
into compartments, each having its own hinged cover in order to store together dif- 
ferent medical substances with no risk of contamination. Assuming a terminological 
and typological overlapping between the gappaxo0fikar mentioned in the written 
sources and any of the archaeological boítes médicales, it is tempting to surmise that 
this compartmentalized rectangular type with sliding lid was the most suitable shape 
for the domestic pharmacy chest mentioned in the Oslo papyrus. This hypothesis re- 
lies on the association between the almost intuitive formation of the compound - lit- 
erally a case for medicines (påppakov + Orikr) - and the most common type of con- 
tainer with that function in both professional and non-professional contexts. 
Archaeological discoveries have offered several specimina dating back to the Roman 
period, therefore contemporary with the papyrus from the Oslo collection. Often very 
well preserved, and sometimes still holding residues of their pharmaceutical con- 
tents, some of these Ofjkat surely derive from a professional field, but a similarity in 
shape of their household counterpart may be likely presumed.? 


31 SeAtdpiov occurs with this meaning only in P.Oxy. LIX 4001,30-1. For a discussion on the term and 
for references to ancient sources, cf. ANDORLINI 1996, 7-8 (especially n. 5); FISCHER 1997; MARGANNE 
2004a, 124—5 and 2004b, 23-4 with bibliography, as well as pp. 31-3 (with n. 30) and pp. 38-40. 

32 Eyyeipiguov is a case - as the etymology suggests — suitable to be carried “in the hand”, likely 
similar in form to a SeAtdptov, cf. MARGANNE 2004a, 125—6. 

33 mrpa is listed in GMP II 10,6-7 (= P.Strasb. Copte inv. 563,6-7) tiv mpav | Tod iatpod with other 
medical instruments and glossed with the word éyxetpidtov by Ammon. Diff. 390,2 (101,5 Nickau), cf. 
MARGANNE 2004a, 126 and FISCHER BOVET 2009, 163-4. 

34 For some significant examples of this type of rectangular boxes for medical purpose, see in par- 
ticular BLIQUEZ 1994, 69 and 191 no. 296 (ills. 189-90); DENEFFE 1893, 37-8 with PI. 2 (ills. 1 and 6); 
MILNE 1907, 172-3 with PI. LIV; KÜNZL 1996, Abb. XXXIV. 
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Finally, in both of these illustrative cases, an operation of “archaeology of the 
words’, consisting in integrating written and material evidence, contributes to shed 
new light on the (hypothetical but plausible) physical reality hidden behind the an- 
cient words. 


2.3 Lexical studies and ‘textual-philological’ awareness 


A thorough analysis of a term in all its written attestations can bear huge potential by 
improving our awareness of the ancient texts and their — sometimes highly problem- 
atic — philological issues. In-depth lexical studies like those offered by Medicalia 
Online can indeed give a considerable contribution to promote the contextual and 
philological interpretation of medical texts, sometimes even resulting in new read- 
ings and corrections to the established editions. The critical exegesis of these texts is 
strictly dependent on the understanding of their technical terminology, to an extent 
that reaching a more exact definition of the words expressing the contents can en- 
hance the philological awareness of the textual context in which they occur. 

Furthermore, Medicalia Online allows both a ‘holistic’ and an ‘atomic’ approach 
to the object of study. The analysis is carried out in a ‘holistic’ way in the sense that 
the term is studied in all its occurrences in order to get the broadest possible overview 
of its meaning(s). At the same time, it is also performed in an ‘atomic’ way, so that the 
word is grasped in its verticality and all its details are accurately examined and un- 
derstood. This operation encourages a ‘double movement’: from the text(s) to the 
word - the ‘holistic’ study of the term in all the written sources attesting it leads toa 
deeper comprehension of the term itself —, and from the word to the text(s) and the 
textual tradition. This thorough knowledge of the word entails a better understanding 
chiefly of the medical text on papyrus that is the starting point of the lexical analysis, 
but also, very often, of the passages — or at least of some of them — by medical authors 
that use this word. This implies a more refined philological awareness that can also 
enable us to amend errors in the manuscript transmission. 

To illustrate the contribution of an in-depth analysis of the technical terms to the 
exegesis of the medical texts - whether medical papyri or passages by medical writers 
— it may be useful to take as a specimen the observations raised by the study of the 
word xa0£8pioc.? koO&6ptoc is an adjective literally meaning “of or for sitting” (LSJ? 
851 s.v.). It is derived from the name of the object that concretely receives the action 


35 Cf. MedOn s.v. These observations were presented as a guest lecture during a seminar of papyrol- 
ogy held at the University of Oslo (February 10, 2017) in the frame of the research project “Strength- 
ening Research Capacity in the Papyrus Collection of the University of Oslo Library” with the title 
“Place the patient in the sitting position...” The word xa0£ópioc in medical authors and medical papyri. 
Iam grateful to the participants for their constructive criticism and suggestions. In this connection, I 
want especially to thank Anastasia Maravela, Agnes Tóthné Mihálykó and Jens Mangerud. 
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of sitting, ka0£8pa (“seat”). As to the word formation, xa@éSpiog is a denominative 
adjective in —toç, suffix of Indo-European origin usually employed to form adjectives 
from noun-stems and productive during the entire history of ancient Greek, from the 
Homeric language to the Koine.” Like all the other derivatives of the noun xo0£6pa,* 
ka0édpiog seems to be quite late, since it does not appear before the II century AD. Its 
earliest extant attestations are two medical texts on papyrus, and this is the reason 
why the evidence of the papyri is particularly relevant in this case: P.Aberd. 11,10, an 
ophthalmological catechism of the II century AD (http://litpap.info/dclp/63332), and 
P.Ryl. III 529r,57 (and maybe ll. 70-1), a fragment from a papyrus codex of the late III 
century AD containing a surgical treatise concerned with the treatment of shoulder 
dislocation (http://litpap.info/dclp/59970). 

Starting from these early attestations, kaO£ópioc is an adjective appearing only in 
medical sources to denote the sitting position of the patient, so that it seems to bea 
genuine terminus technicus of the medical micro-language. Its technical value is 
strongly confirmed by the medical authors, from Oribasius to Paul of Aegina, where 
the adjective is mostly used in the description of different kinds of surgical opera- 
tions, but also in other medical contexts. The common formula is ko0£6piog + ó 
xópvov / nóoyov + oynpatitw vel sim., with ka0£6pioc, as a rule, in predicative pos- 
ition,” but the adjective is also associated with the noun oyñuo.?” 

In order to contextualize the topic, it might be relevant to stress that placing the 
patient in the proper position has been an important prerequisite for surgery since 
ancient times. Like nowadays, different physical positions were required for different 
procedures. Therefore, it was essential to identify the correct position for any given 
operation. The importance of surgical positioning emerges, for instance, from a pas- 
sage of the Hippocratic treatise De officina medici (Off. 2 = III 275-6 L.): 


tà 8' £c xeipoupyünv Kat’ intpeïov: 6 doBevéwv: ó 6püv: oi ónnpétarť tà Spyava: TO Pas Ókov: 
Okoç: 600° Ökwç' Skov TÒ oàpa, TA üppevo: 6 xpóvoc: 6 Tpórtoç: Ó TÓrtoç, 


36 Cf. CHANTRAINE 1933, 33-8. 

37 The noun ko0é6po, a compound of é5pa «seat, chair» (< katà + £6pa), is in itself a derivative in - 
pa of Étopio “seat oneself, sit” (LSJ? 478 s.v.), from the IE root *sed “sit down". All the derivatives of 
ka8£ó6pa seems to be of quite late formation, such as the neuter diminutive ka0£6póptov “little seat”, 
appearing in a private letter on papyrus, P.Oxy. VI 963 (II-III AD), and the compound xAtvoka0£ó6ptov 
“easy chair” (LSJ? 961 s.v.), gloss of kAtvtrip “couch” (LSJ? 961 s.v., cf. e.g. Et.M. 520,26-7 Kallierges). 

38 Cf. e.g. Orib. Coll. XLVI 11, 2,1-3,3 (CMG VI 2,1, 219,30—5 Raeder); Paul.Aeg. VI 8, 1,10-1 and 90, 
4,10-1, as well as 99, 2,1-6 and 101, 1,3-6 (CMG IX 2, 51,11-2 and 139,10-1, as well as 152,14-9 and 
156,19—22 Heiberg). 

39 Cf. e.g. Aét. XV 5,50-2 (19,15-7 Kostomiris) £v pèv oov t@ évepyev, oynuatıčéoðw 6 náoyuv 
KATAKEKAWEVOG, TO yàp Kagéspiov oyfpa £c Artogupiav váy ro niporpérner TOV náoyovra: “during 
the surgical operation the patient must be placed lying on the back, for the sitting position causes 
quickly the patient to faint". 
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in which the adverbs 6xov and òkwg express respectively “where and how” and refer 
to what precedes in the list, that is “the patient, the operator, the assistants, the in- 
struments, the light”.“° This aspect is also evident in a medical papyrus from Oxyrhyn- 
chus, P.Oxy. LXXIV 4972, dating back to the II-III cent. AD, which belongs to the cat- 
echistic genre and contains a systematic exposition of the divisions of surgery 
(http://litpap.info/dclp/119317). In ll. 3-7, the most correct positions for certain oper- 
ations are discussed and, in particular, it is mentioned when the patient is lying back 
on a sloped couch with either the head (avdppomov) or the feet (xatàpportov) raised 
higher: 


TÒ 62] oynpatixév got TO TO[V | Emrin]éiwv oynuatwv ionep | Stav] Agywpev dvaportov 
(I. àváppomov) ü | kaváp]onov (l. katápponov) oxnpativt (l. oynpatitew) tóv | kápv]ovta 


The position-based is that concerned with appropriate positions, as when we speak of position- 
ing the patient tilted up or tilted down. 
(transl. LEITH 2009, 63) 


The synchronous ‘holistic’ and ‘atomic’ study of the adjective ka0£ó8piog in all its 
occurrences led to suspect some textual uncertainties in two passages from late an- 
tique compendiasts. The first one, transmitted by Oribasius, is part of an excerpt 
taken from the work entitled ITepi ztotovuévov BonOnudtwv by Antyllus, a surgeon 
and physician of the II century AD. The passage concerns the most proper couch for 
the patient. It is reported that a tilted couch, i.e. a couch with the head higher than 
the feet, puts the patient under strain because (s)he is like seated on a chair, even 
if this position is suitable for those who suffer from head pain. The text printed by 
the Teubner editor of Oribasius, J. Raeder, runs as follows (Coll. IX 14,6,1-3 = CMG 
VI 1,2, 15,21-3 R.): 


f| (sc. KAivn) 6' åvápponoç o«ó6pa kórtou nontik, £okvia ka8s8ptov oxńpatı, roig 82 nepi 
KEMOATV oUk áváppooroc. 


The manuscript tradition is unanimous in recording xa0e6píou oxñuqri but U.C. 
Bussemaker and Ch. Daremberg in their previous edition of the Collectiones Medicae 
(Paris 1854, p. 310,4) make the emendation ka8edpiw oyńpartı. In the former case, the 
term in genitive is a noun, i.e. the neuter ko0£8piov, and is referred to the material 
object, a small chair.” But this juxtaposition is unparalleled, whereas the adjective 


40 As to the patient, Ókou means where and óxoç how the patient is placed, that is his position, cf. 
comm. ad l. by Littré (Paris 1841) 276. Cf. also Gal. In Hp. Off. 1 6 (XVIIIb 668,9-670,5 K.). 

41 Cf. ThGL V 770D s.v. ka8edptov. 

42 The only certain medical attestation of the neuter noun xa8é6ptov is Sor. Gyn. II 37,5,1-4 (CMG IV, 
80,21-4 Ilberg) pikpòv 8’ év toig &ykóAotg adtò npo6takatéyovoa PETA TO ovppérpou petaoyeiv 
yóAakrog Komitétw Kad’ otac vrte8ei&opev Koitng, npokúntov Bè Kal éykNivov xageSpiw, where 
ka8esptov represents the stool on which the woman who breastfeeds the newborn is seated and bends 
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KadEedptos + oyfipa is certainly attested, as aforementioned, and appears in dative also 
in another excerpt from the same work by Antyllus (cf. Coll. VI 23,9,4-10,1 = CMG VI 
1,1, 181,23-4 Raeder: tv 8‘ amupétwv (Tobe ...... ) xa0egpiw oyńpatı aiwpntéov). 
Thus, despite the consensus of the manuscripts, the study of all the attestations of the 
term leads to approve, on a grounded basis, the correction made by Bussemaker and 
Daremberg. 

Also in a passage from Paul of Aegina, in a chapter on the insertion of the catheter 
in a male bladder, the occurrence of the term seems to conceal an element of philolog- 
ical uncertainty. The text edited by J.L. Heiberg (VI 59,1,9 = CMG IX 2,98,10 H.) is tov 6š 
kduvovta oynpaticavteg eic ka8£6piov, literally “having placed the patient on a seat”. 
According to Heiberg’s apparatus, the only divergence in the manuscript tradition 
should be the presence of Kai following xa8£6ptov in two codices of the XIV century 
from Paris (D, cod. Paris. Gr. 2208, and F, cod. Paris. Gr. 2292). However, it might be 
relevant that the second hand of a codex of the X century, V (cod. Paris. Gr. suppl. 446), 
deleted gig. This deletion might reflect the common expression with koO£ó6ptiog as an 
adjective (tov SÈ kduvovta oynpaticavteg ko0š6piov), whereas in the text transmitted 
by the manuscript tradition and printed by Heiberg ka0£6piov is a noun and is unpar- 
alleled in this formula. Therefore, in case the deletion made by V?is correct — as it seems 
to be given the occurrence of the term in similar expressions —, the adjective xo0£6pioq 
would acquire its common predicative value, even if it is usually placed before ó 
kduvwv / náoywv, and not after, as it would be in this passage. 

Even more significant from the textual viewpoint are the aforementioned med- 
ical papyri attesting the word. In the earliest papyrus, P.Aberd. 11,” the term ap- 
pears in the discussion on pterygium surgery immediately before the lacuna at 1. 10 
(cf. 11. 9-13): 


xetp[oupyeia Tob mtepuyeiou.] | petà Tov KabEspeto[v dvta Tov nóoyovra, £k] | rob óqO0oAuo0 
8up[uñ BA&popa SiaotTeiAavtes] | TO ntepúysiov Silexpavodpev &yxi]|orpetou peAbvnv [6 Aivov 
xai Tpixa irutetov] 


forwards. In such a case, the term bears no technical value, being a simple (and generic) stool. More- 
over, the neuter ka0£8piov occurs only twice in late non-medical texts. In Zonar. è 524,27 Tittmann 
s.v. Gi£Opiov the term kaðéðpıov is used as the interpretamentum of 5té6ptov, a compound of é6pa 
meaning a “seat for two persons" (LSJ? 423 s.v., cf. also Suda 8 896,1-3 Adler s.v.). It is probable that 
here the gloss ko0&6piov simply represents a generic (though not recurring) word for a “seat”, without 
a more specific correlation with 8i£8ptov. In schol. A. Th. 454h (209,7 Smith) tàv nopO0eviküv 
KadeSpiwv: E5wAtov BÈ kupiwg Ó (uyoç tñç vnóç, the juxtaposition rtapOevikàv xadespiwv appears to 
be the explanation of Th. 454-5 nwAtk@v / 0' E5wAiwv. Thus, the plural of ka0£6ptov — as well as of 
ko8£6pa — seems to acquire the extended semantic value of “bower, abode”, that is one of the mean- 
ings of the plural £5wAta (cf. LSJ? 478 s.v.), even though the second part of the scholium refers to the 
meaning of &60t0v as “rowers’ benches”. 

43 Ed.pr. TURNER 1939, 13. The text was republished (ed.alt.) in MARGANNE 1994, 104-11. 
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Surgery of pterygium. After the patient is seated, having separated the two eyelids of the eye, 
we will isolate the pterygium from the eye with a hook, and a needle having a thread and a 
horsehair... 


The restoration 6vta Tov raoyovta, with the participle dependent on perá, “after” in 
temporal sense, is a likely supplement, but also kápvovtra may be considered.“ On 
the one hand, the verb iui is unparalleled in this expression, on the other hand the 
space is not enough for the usual oynpatitovta, and this makes the presence of óvra 
plausible. Moreover, the article before the adjective seems to suggest an attributive 
value, whereas in the medical authors, as already seen, xa8€Sptos is always in predi- 
cative position. Different is the restoration proposed by TURNER 1939, 13: peta Tov 
xa0£ópeio[v Biov ?]. It was very probably influenced by a passage by Soranus (Gyn. I 
27,3,4 = CMG IV 18,2 Ilberg: ka0£8piov Stdyetv piov), the only author who uses the 
adjective ka0£6pioc with the meaning of “sedentary” (LSJ? 851 s.v. ka0é6puoç 2), but it 
absolutely makes no sense in this context. This last case strongly illustrate how a good 
lexical awareness of the semantic values of a word in all its occurrences and kinds of 
sources can contribute to a better philological understanding of the text itself. 
Philologically challenging and stimulating are the ll. 66-76 of P.Ryl. III 529r (col. ii): 


oi] | pèv dMot KaGE[Sptov , . . ]|woav Tov kép[vovta] oxn||paTIGetv. ñust[ç 62 , . . . . ] | kexAtpévol] . 1 
` v ^ tó "AMe£áv]|6piov. oxňpá got Svload]lyéotatov, tò SÈ KkexA[wé]|vov doparéotepov. 
alopa]|Agotepov Se ote pèlv drt] |ov oxnpaTifetv Tov [náo]|xovta, OTE 8š mpnvii. 


The other physicians put the patient in the sitting position, whereas we lay him/her down: the 
“Alexandrian position” is extremely painful, while the lying down position is safer. It is safer to 
place the patient sometimes on his/her back, sometimes on his/her stomach. 


The Rylands papyrus was first published by ROBERTS 1938, 158-62 (ed.pr.), then re- 
published by MARGANNE 1998 110-47 (ed.alt.) with some textual differences, as in the 
case of 1. 67. Here the adjective ka0£6piogc was very plausibly restored by MARGANNE 
1998, 112 and 117, whereas the editor princeps (ROBERTS 1938, 160) simply printed 
Katel. The term is mentioned in the earliest extant discussion on the best and less 
painful position in which to place the patient before performing the reduction of 
his/her dislocated shoulder. Indeed, in Greek medical sources the sitting position, the 
xadédpiov oyfua, is sometimes explicitly rejected or preferred to other positions, es- 
pecially the supine one. 

A first observation on this passage: given the end woav in 1. 68 and the context, 
one could expect the first aorist active indicative of a verb of thinking/saying (e.g. 
“think”, “claim”, “advise”) in the lacuna of 1. 67, followed by the infinitive oynpativetv 
in ll. 68-9. Considering the previous presence of xa0£6piov and what seems to be the 
space left at the end of the lacuna, which is measured with three dots in Marganne's 


44 Cf. MARGANNE 1994, 106. 
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edition, a plausible supplement might be perhaps néi]|woav from d&t6w, with the 
meaning “to think”, i.e. àÀAot xagg[Spiov néi]|woav Tov xáp[vovra] oxn||uoriÇeuv. In- 
terestingly, ROBERTS 1938, 16, though without any supplement to the text, translates: 
“other authorities (advise) that the patient be disposed in a ... position”. 

Moreover, at the end of 1. 69 before the break, possible supplements might be: 
fipei[g 6€ p&AAOv] | xexAévol[. J v ’ or nuei[g 8€ Kata]|kexApeévol], ] v ^, cf. Aét. XV 
5,50-1 (19,15-6 Kostomiris) oynpatiféo8w ó náoyuv KaTAKEKAILEVOS. 

A more puzzling textual issue concerns ll. 70-1. The context (Il. 66-81) is the de- 
scription of a method of reduction opposed to the procedures of other physicians or 
schools and it is specified that others place the patient seated; on the contrary, the 
author recommends lying him down. According to the restoration made in ll. 70-1 by 
both Roberts and Marganne, i.e. ’AAle&dv]|Sptov, the so called 'AAe£&áv6ptov oyfipa is 
defined as jvoaXyéotatov, whereas the lying-down position is regarded as less painful 
and safer. Marganne (p. 129) considers the 'AAe£ávópiov oyfipia as the same as the 
sitting position mentioned in l. 67, and not as a different posture. In such a case, the 
second statement (tò 'AMe&àv]|8ptov oyfipà otv Svu[oaA]lyéotatov, tò 6 KeKAll- 
p£]|vov àoqaAéorepov) would seem to explain the reason for the author's preference 
for the lying-down position, as expressed in the previous lines (oi] | u£v GAXor ka0£- 
[Sptov . . . ]|woav tov káp[vovro] oxn||paTiGetv. rpei[ 88 , . . . . ] | kexiupévo[ , T" v ^). 

The point is that there is no attestation of the 'AAe£ávópiov oyfipa in medical lit- 
erature, and Withington's suggestion” that the 'AAe£ávépiov oyfjpia may correspond 
to the position on the Thessalian straightbacked chair, the péya £60; Oeooañıkóv, 
used for dislocations and mentioned by Hippocrates (Art. 7,36-43 = IV 92,10-94,1L., 
cf. also Gal. In Hp. Art. 122 = XVIIIa 344,1-345,8 K.), seems unmotivated.^ 

The reading 'AA[c£áv]|6ptov, accepted by MARGANNE 1998, 112, was first restored 
by ROBERTS 1938, 160, who did not understand the presence of xa8¢[8ptov in 1. 67, as 
aforesaid. But a collation with the digital image of the papyrus" has raised difficulties 
with the supplement ’AA[e&dv]|Sptov, inasmuch the traces are incompatible with the 
usual shape of a and À. At the break, indeed, part of a small horizontal trace survives, 
consistent with the lower trait of an a. It is preceded by what seems to belong to a 


45 Cf. ROBERTS 1938, 162. 

46 Cf. MARGANNE 1998, 129 n. 29: “Le remarque de Withington [...] est superflue. Il est vrai que ni lui, 
ni Roberts n'avaient restitué, à la 1. 67, xa8&[6piov" . 

47 The only available image of this papyrus is black and white and is stored in the photographic 
archive of CEDOPAL (University of Liége). Under my request Prof. Marganne, to whom I express all 
my gratitude, provided me with the available image. Then, in order to have the best image possible 
to check the text, I purchased high-quality images of the recto and the verso of the Rylands papyrus 
(format: large TIFF 600 dpi) from the Centre for Heritage Imaging and Collection Care (CHICC) of the 
University of Manchester Library. I would like to take the opportunity to thank John Hodgson, Manu- 
script and Archives Manager, and Tony Richards, Heritage Photographer, for the excellent digital im- 
ages that now I own. 
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vertical trait slightly sloping to right. The shape of these traces can be convincingly 
compared with the sequence ya recurring several times in the papyrus. ? Accordingly, 
also taking into account the ending 6piov in 1. 71, a likely supplement that seems pale- 
ographically compatible and fits well the space is tò yà[p ka0é]|&pov. Additionally, 
on the syntactic side, the presence of yà[p would acquire an explanatory function per- 
fectly appropriate to the content, thus introducing why the author - that in the previ- 
ous sentence affirms his preference for the lying-down position — rejects the 
Kxaðéðpıov ocyfipa, being it GuooAyéotatov, “more painful”. This sense of opposition 
is then marked and strengthened by the adversative 6£ (1. 72), where it is stated that 
the lying down position is safer. 

To conclude on this aspect, the example of kaBéSptog is useful to highlight the 
potential philological contribution of the vertically in-depth and both ‘atomic’ and 
‘holistic’ lexical studies carried out for Medicalia Online. Thus, given the high degree 
of textual awareness offered by this kind of lexical studies, the present specimen il- 
lustrates how the medical text can be improved in terms of new and more reliable (or 
at least plausible) readings and emendations, as well as it can be deeper and better 
understood: 


oi] | ušv GAAOL ko0é[6piov HEi]|woav 1óv káp[vovra] oxN||HaTIGetv. ñugt[ç 8$ pA ov] | KeKAtpévol] .] 
`v” TÒ yalp KaG]|Sptov oxñuú Eotwv 6u[oaA]|yeotatov, 16 Bè xexA[uié]|vov dopadéotepov. 


The other physicians think that it is better to put the patient in the sitting position, but we rather 
that it is better to lay him/her down: the sitting position is indeed extremely painful, whereas 
the lying down position is safer. 


3 Conclusions 


The experience of a specialized lexicographical tool like Medicalia Online demon- 
strates the different kinds and levels of usefulness and the potential of a digital lexi- 
cographical database characterized by a wide-ranging approach in the era of Digital 
Humanities. Among these, the usefulness of a tool specifically conceived to carry out 
in-depth, ‘vertical’ lexical studies aimed at providing the broadest possible overview 
of the examined items under multiple perspectives, namely linguistic, archaeologi- 
cal, historical-scientific. The usefulness concealed in the inner nature of a digital in- 
strument, that is its dynamism, its flexibility, its capability of allowing the author of 
the entry to constantly update the contents. The usefulness of the interdisciplinary, 


48 Cf. recto col. 11.5 yap, 1. 15 avaAoya, ll. 33 and 50 yap; verso col. I ll. 106 and 109 yap, as well as 
1. 110 ópyavtxoic, and finally col. II 1. 124 totyapobv and 1. 127 épya. 
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comprehensive approach to the Ancient World, which makes Medicalia Online a help- 
ful resource for contributing to improve and reconsider the studies in medical papy- 
rology as well as in the other subject areas involved, and broadens the circle of users 
to any scholar or enthusiast of those research fields. Furthermore, the user-friendli- 
ness of the entries in spite of the wide range of information and, most importantly, 
the mutual enrichment resulted from the interconnection between the MedOn lem- 
mas and the digital editions of the medical texts on papyrus in the core database. This 
strongly encourages an enhanced textual understanding stimulating a double aware- 
ness — a concrete one and a textual-philological one -, as the selected specimina have 
illustrated. The outcomes are, on the one hand, the attempt to reconstruct the physi- 
cal reality hidden under the words and, on the other hand, a better and deeper exe- 
gesis of the ancient medical texts through a better and deeper knowledge of the words 
themselves. 

Such a ‘vertical’ lexicographical approach also helps elucidating the degree of 
technicality of the medical terms, as well as of the texts containing them. Two of the 
presented case studies can be taken as illuminating examples of this aspect. ka0£6pr- 
oç (8 2.3) has a marked technical value, since it occurs, with its specialized medical 
meaning, only in strictly medical sources; conversely, the compound qoppaxo8rkn 
(8 2.2), characterized by a pronounced morphological transparency, did not devel- 
oped into a genuine technical term and was probably employed as a colloquial noun 
for an object of medical use, the physician's tool-case or its household counterpart, 
so that it was not a medical terminus technicus but just a word with a medical mean- 
ing. And indeed it does not appear in a stricto sensu medical papyrus, but in a private 
letter carrying medical information, P.Oslo II 54. Horeion, the author of the letter, who 
lives away from home for personal or professional reasons, asks his father Apollonios 
to send him the portable medicine-chest (ll. 5-6 néjupov | por tiv qoppakoOrkny), as 
well as two remedies with different properties, that is a biting pharmakon and a much 
milder one (ll. 7-9 aitrjoag n[ap]à tod iatpod | papparxov Saxvnpov| xai £repov 
15Utepov). The terms employed by Horeion point to different degrees of technicality 
and particularly significant is the juxtaposition between the pappakov 8akvnpóv and 
the rj8ovepov one, likely representing two kollyria of contrasting virtues, perhaps be- 
longing to the typology of the acharista.? These terms, indeed, especially the former, 
are technical adjectives applied to eyesalves and ophthalmic preparations. Thus, 
whilst the vocabulary employed by the writer reflects different levels of technicality, 
the choice of accurate and rare words reveals his proficiency in the medical field and 
a certain medical literacy.” This might mean that Horeion was either a literate layman 
interested in medicine, or even a person with healing skills, such as a would-be phy- 
sician or a pharmakopoles. The case of P.Oslo II 54 is useful to demonstrate how a 


49 Cf. BONATI 2016b, 667-9. 
50 On the topic, cf. HANSON 2010. 
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thorough lexical study of medical papyri from Greco-Roman Egypt contributes to 
shed new light on the socio-linguistic context in which these papyri were inserted, 
and may provide invaluable glimpses of the individuals “behind their words'. 

Finally, all these factors restate again the usefulness of Medicalia Online and its 
methodolosy to plumb the depths of the past, allowing for the revaluation of the en- 
tire material come to the present and promoting a more integrated knowledge of the 
Ancient World, to which a voice - if not ‘living’ at least not ‘dead’, or irremediably 
lost -can be returned: the echo of its written and material soul. 
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monograms 25-6 

Mons Claudianus 109, 116-7, 127, 131 

Morpheus 108-9, 111, 117, 140-1, 145-6 
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Olson, S.D. 89 

ontology model 5-6, 10, 48, 51, 52, 61 
open access 83,91 

Orestes 91-3 

Oribasius 22, 35, 37-8, 43, 47, 164-5 


orthography 89-90, 93, 109-10, 115, 119, 
122-3, 125, 128, 130, 132-3, 135, 137 
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paratextuality 30, 32-3, 38-9, 40, 48, 51-2 
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123-5, 128-30, 134, 143, 155 

phraseology 42, 137 

Pierazzo, E. 45 
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Vierros, M. vi, 7, 10, 14-15, 20-2, 105, 140-1, 
157-8 

Vigilanti,R. 139 

Vindolanda 21, 23 


virtuality 8, 46-7, 53, 89, 159 
vocabulary 23, 145-6, 150, 153, 156-7, 160, 
162, 170 


West, M.L. 87 

Wilcken, U. 131 

Withington, E.T. 168 

writing 5,7, 11—12, 18, 20-1, 24, 30, 35, 37, 
43-4, 47, 55, 57, 60-1, 98, 105, 107, 112, 
143, 151-2, 157 

Würzburg 20, 22, 47, 64, 68, 71, 140 
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P.Grenf. Il 15 
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P.Mich. XVII 758H r 11 

P.Mich. XVII 758 H v 2, 3 

P.Naglun Il 22 

P.Oslo Il 54 

P.Oslo Il 54,6 

P.Oslo III 72,9 

P.Oslo inv. 1576 

P.Oslo inv. 1576,5 

P.Oxford Sackler s.n. 

P.Oxy. | 

P.Oxy. 1 40 

P.Oxy. 1 135 


38 
43 
124 
75 
76 
119 
141 
127 
127 
8, 133 
69 


69, 71 


69 


124 
35 

29, 33, 36 
119 
126 
126 
126 
126 
126 
128 
22,37 
47 

95 

16 

39 

99 

47 
47-50, 52 
51 

25 

25 
123 
170 
161 
32 

28, 38 
28 

33 
119, 127 
11 

126 


P.Oxy. 1 136 

P.Oxy. 1 137 

P.Oxy. 1 138 

P.Oxy. 1 144,4 

P.Oxy. 1 155,4 

P.Oxy. Il 234 + LII 3654,92 
P.Oxy. VI 963 

P.Oxy. VIII 1088 

P.Oxy. IX 1184 

P.Oxy. IX 1184v 

P.Oxy. XIX 2221r 
P.Oxy. XIX 2238,18, 21 
P.Oxy. XXXIII 2665,17, 19 
P.Oxy. XXXIV 2724,20 
P.Oxy. XLIII 3117 
P.Oxy. XLIII 3117,13-14 
P.Oxy. LIX 4001 

P.Oxy. LIX 4001,30-1 
P.Oxy. LVI 3851 

P.Oxy. LXI 4131,16, 39 
P.Oxy. LXIV 4441,x,27 
P.Oxy. LXX 4790,16, 19, 30 
P.Oxy. LXXIV 4972 
P.Oxy. LXXIV 4973 
P.Oxy. LXXX 5239 
P.Oxy. LXXX 5239,15 
P.Oxy. XVI 1898,19, 38 
P.Petr. 112 

P.Petra | 4,5 

P.Rein. Il 20 
P.Ross.Georg. | 20 
P.Ross.Georg. V 34,2 


P.Ryl. III 478 
P.Ryl. II 529 
P.Ryl. III 529r 
P.Ryl. IIl 5291,57 


P.Sorb. inv. 2302 

P.Sorb. inv. 2303 

P.Stras. VIII 772 

P.Stras. VIII 772,6, 9, 15, 21 
P.Strasb. inv. 849 
P.Strasb. inv. 1187 
P.Strasb. inv. 1187, A, i,11 
P.Strasb. inv. 1187, A, i,14 
P.Strasb. inv. 1654 


P.Strasb. Copte inv. 563,6-7 


P.Tebt. Il 272v 
P.Tebt. Il 272v,4-5 
P.Tebt. Il 318,23 


126 
126 
126 
126 
159 
16 
163 
22 
16, 18 
42 
29, 33, 36 
128 
127 
124 
126 
126 
160 
161 
44 
128 
133 
126 
164 
16 
12, 28,38 
28 
128 
119 
133 
99 


22, 24,37 
24, 44 
24, 47 

99 

161 

43 

44 

127 


P.Tebt. Il 679 
P.Tebt. III.1771 
P.Tebt.Tait 39 
P.Tebt.Tait 40 
P.Tebt.Tait 41 
P.Turner 14 
P.Yale Il 134 

PSI inv. 3783 

PSI inv. 22011,48 
PSI IV 428,89-90, 92 
PSI V 454,19 

PSI VI 718 

PSI X 1180a iii,12 
PSI XV 1510 
SBI4668,4 

SB VI 9625,23 
SB X 10559,1 


III. Literary sources 


Aesch. Th. 454-5 
Aet. VI 50,75-9 
Aet. XV 5,50-2 
Aet. XV 5,50-1 
Ammon. Diff. 390,2 


Cat.Cod.Astr. 1104,26-30 


Dsc. MM III 147 
Etym.Magn. 520,26-7 
Eur. Or. 1-25, 401-25 
Gal. Antid. | 5 

Gal. Comp.med.gen. 
Gal. Comp.med.gen. 13 
Gal. Comp.med.gen. | 4 
Gal. Comp.med.loc. |1 
Gal. Comp.med.loc. | 2 
Gal. Comp.med.loc. Il 3 
Gal. In Hp. Art. 122 

Gal. In Hp. Epid. ll 4 

Gal. In Hp. Epid. 11 8 

Gal. In Hp. Off. | 6 

Gal. In Hp. Off. Ml 22 
Gal. Indol. 33-5 

Gal. Libr.pr. || 91-3 
Heliod. Cheirurg. 
Herod.Med. ap. Orib. 
Hom. Il. 1 5 

Hom. Il. VI 287-8 


165 
40 
164 
167 
161 
161 
29 
163 
92 
39 
41 
51 
52 
39 
40, 160 
40 
168 
41 
11 
164 
11 
39 
39 
38 
43 
96 
96 


III. Literary sources — 185 


SB XIV 1629 

SB XVIII 13732,13 

SB XVIII 13947,15 

SB XX 14990,15 

SB XXIV 15915,6 

SB XXIV 16290 

SB XXIV 16290,2 

SB XXIV 16291,4 

SB XXVI 16458 

SB XXVI 16458,10-13 
SB XXVI 16796,10-11, 16 
Sel.Pap. 1 85 
Stud.Pal. III 314,1 
Stud.Pal. VIII 791,1 
Stud.Pal. VIII 875,2 
UPZ159 

UPZI7 


Hom. Il. VIII 433-47 

Hom. Il. VIII 526 

Hom. Il. X 

Hom. Il. X 10 

Hom. Il. XVII 578 

Hp. Aph. 

Hp. Art. 7,36-43 

Hp. Epid. IIl 16 

Hp. Fract. 37 

Hp. Off. 2 

Nic. Ther. 333-4 

Nic. Ther. 387 

Nic. Ther. 394 

Orib. Coll.med. 

Orib. Coll.med. IV 615,19 
Orib. Coll.med. IV 620 
Orib. Coll.med. V 30,6-7 
Orib. Coll.med. VI 23,9,4-10,1 
Orib. Coll.med. IX 14,6,1-3 
Orib. Coll.med. XLVI 11,2,1-3,3 
Orib. Syn. Eust. Ill 28,6, 9 
Orib. Syn.Eust. VII 45 
Paul.Aeg. IIl 2,2,4—6 
Paul.Aeg. III 79,1 
Paul.Aeg. VI 8,1,10-11 
Paul.Aeg. VI 59,1,9 
Paul.Aeg. VI 90,4,10-11 


168 


164 


186 — Indices 


Paul.Aeg. VI 99,2,1-6 
Paul.Aeg. VI 101,1,3-6 
Paul.Aeg. VII 17,31 
Philod. Mus. IV 
Philod. Piet. 

Philod. Poem. 1 
Philod. Rhet. 2 

Plat. Meno 95e 

Poll. On. IV 203 
Procl. Or. 18 

Ps.Gal. Def. med. 
Ps.Gal. Def. med. 424 
Ps.Gal. Introd. XII 11 


IV. Greek concordance 


164 Ps.Hp. Ep. 3, 6, 6a 42 
164 Ps.Hp. Ep. 4, 4a, 5 42-3 
52 Ps.Hp. Vet.med. 2,17-8 149 
70 Ps.Sor. Quaest.med. 38 
69 Sch. Aesch. Th. 454h 165 
69 Sch. Eur. Or. 1-500 91 
69, 71 Sch. Eur. Or. 1 93 
66 Sor. Gyn. 1 27,3,4 167 
28 Sor. Gyn. Il 37,5,1-4 165 
161 Steph. In Hp. Progn. Il 1 28 
38 Suda 6 896,1-3 165 
28 Thgn. 434-8 66 
28 Zonar. Lex. 6 524,27 165 


N.B. Single Greek letters are not indexed. 


åykáñaç 165 
dykiotpeior 166 
ddeApe 109 
adEAQds 160 
A8avdoioc 79 
A0Úp 27 

A0úç 27 

Ja 115 

aikaBa 116 
aitmoag 170 
aitiat 43 
aiwpntéov 166 
Ade&avoptov 167-8 
&Ma 160 

GAAOL 167-9 
åpetakivntov 145 
üpmeAuóv 124 
dugpioRava 29 
augiopatva 29 
-av 125 

avti 160 
avadoya 168 
üvdppootoc 165 
åváponov 165 
üváppomov 164-5 
åvápponoç 165 
à&i00 167 
áóNccay 96 


anoddow 123 


ånoðwow 123 
GmAR 124 

and 116 
anddog 160 
amokpeivaodar 126 
ånupétwv 166 
dpaxog 27 
dppeva 164 
Aproñoioç 116 
dptoug 109 
dptwv 109 
åoðevéwv 164 
åonaoópeða 119 
domdZopat 119 
domdoacbat 119 
dondoeoda1 119 
QopoÀ)éotgpov 167-9 
aùtå 109 

auto 165 
aùtoù 27,160 
Qut) 27 
äpwvoç 31 


BeAbvnv 166 

BiBAcia 126 
BiBAeidiov 119 
BiBAidiov 119 
BiBAtopuAdK(e)tov 131 


BiBAtopuAdKEelov 126-7 


BiBAtopuAaketov 127 
BiBAtopuAdKiov 126-7 
Biov 167 

BAÉQapa 166 
Bonenpotwv 165 
Bopeito 29 

Boteitat 29 


yaAaKtog 165 
yap 168-9 
yeivecbar 8 
yeivopat 8, 133 
yeivook, 133 
yelwoKiv 133 
yevéodar 8 


yiyveodar 8 
yiyvetat 25 
yiyvopot 133 


yiyvwpevor 133 
ylyvwokelv 133 
yyvooko 133 
yivopat 8, 133 
ywóyeva 133 
ywookew 133 
ywookw 133 
yvOvat 160 
ypapeion 124 
yuvoukóç 116 


daŭra 96 
dakvnpdov 170 
dé 160, 165-9 
del 44 
deAtapiov 161 
diekpavoùpev 166 
didyew 167 
diaoteiAavtes 166 
üi£Optov 165 
õıpu 166 

ëóEn 127 

Spa@v 164 
duooAyéotatov 167-9 


édav 109, 127 
£yKNivov 165 
éyxelpidtov 161 
€d0g 168 

Edpa 163, 165 
EIOMA 165 
EIMMIOv 165 
EdwAiwv 165 
ÉCoyot 163 
Je 109, 125-7 
e 31 

eipnke 116 
eikoot 126 
cipi 166 

civa 127 

cig 123, 160, 166 
gur 24 
gloomtpov 27-8 
k- 130 

ÈK 166 
Ekataotig 124 
Èkatootaîg 124 
£kotootfjc 124 
É Napa 116 
edaBa 117 
EhaBov 116 
£A0riv 126 
ÉA0n 109 
£A0iv 126 
EAuTpoElons 28 
£AutporiboÜc 28 
ÉpAootog 51 
ÉymAaotog 51 
ÉymAaotpog 51 
£v 29,165 


éouuia 165 
Enei 109, 126 
Eni 109, 126 


émtkadoupevns 143 


me 


miotoAnv 116 
£munóiov 165 


Pi 


Epya 168 
Eppev(e)ia 7 
Epubpoeidoic 28 
Eputpoeidi 28 
ëç 164 
Eoomtpov 27-8 


éotw 165, 167-9 


Eoppaylopévat 160 


Éoyope 160 
oyo 109 
Eoyw 109 
Étepov 170 
Éyew 124 
Éye 126 
Éxig 126 


Geuyn 109 
Ontfjoot 160 
Çuúpvn 27 
Guyos 165 


ü 43 

noutepov 170 
noutepov 170 
üluseig 167-9 
nuepa 143 
üludv 160 
nEiwoav 167,169 


OdAapoy 96 
OEddwpos 160 
O£cooÀKÓóv 168 
0eò 79 

Okar 162 
erkn 162 
Onuépa 143 


iotpeiov 160-1 
iatpıkóç 161 
iatpot 161,170 
intpeiov 164 
ikoot 126 

Wa 125 


IV. Greek concordance — 187 


“log 163 
imme(ov 166 
ig 123,127,131 


Ka’ 165 
Ko8éópa 163,165 
Kkadedpdpiov 163 
kaðéðpeiov 166 
Kabedplov 165-8 
KkaBédpiog 163-9 
Kkagedpiou 165 
Kabedpiw 165-6 
KkaBedpiwv 165 


Kai 40,109, 124, 160, 165-6, 


170 
KOAOV 31 
KaAotot 40 
KoAds 109 
Kdpvovta 165-9 
Kåuvwv 164, 66 
kat 164 
Kata 96,163 
Kotayeivopat 8 
Kotayiyvopat 8 
KOTOAKEKAILEVOV 167 
KOTOAKEKAIWEVOG 167 
Katapomov 165 
Kotóppomov 164-5 
kotEBrcoto 96 
KÉ 124 
KEKAWEVOV 167-9 
KEMaAnv 165 
KiBaptdatns 116 
KAivy 165 
KAwvoka8£ópiov 163 
KAvitrp 163 
KOWNIZETW 165 
Kowwviv 126 
Koitng 165 
KoAakelag 144 
KOAAOUpIa 161 
KoAAoupiwv 160 
Kómou 165 
Kü0pa 27 
Kupiwg 165 
Aoddv 31 
Aopfávo 116 
Eywpev 165 
Ae(pa 123 


188 — Indices 


MBa 123 
Mvov 166 
Atotwv 44 
Mwtww 44 
Apa 25 


uov 167, 169 
Mopovot 115 
Mapovatog 115 
pé 167 

péya 168 

év 167-9 
pepidav 124 
pet 25, 165-6 
petaoyeiv 165 
un 31 

piov 124 
puxpov 165 
pitou 43 

Hot 8,109, 116, 170 
uòvng 160 

pou 8,116 


vápOnE 161 

vavdarg 124 

vaüAoig 124 

Nexoutou 115 
Nexoutwt 115 

vnos 165 

voookopEiov 127-8, 131 
vocokopiov 128 

vuki( 109 


Eepov 25 


öykoç 36 
öðev 160 

oi 164, 167-9 
oiag 165 
olvov 31 

ökou 164 
OKTw 125 
ökwç 164 
OAKOG 36 
ópoAoyO 124 
Ovopa 27 
Ovta 166 
ofuyyeiou 160 
OEUyyiov 161 


ó&uyy(ou 160 
ópyava 164 
opyavikoîg 168 
doa 164 
óoumtpov 27-8 
ótav 109, 165 
oté 167 

óu 116 
oùykia 25 
oÙKk 109,165 
òpoaApoù 166 


mode 126 
náv 66,126 
návta 160 
Tapa 170 
Trapayeivetat 8 
Trapayeivopat 8 
Trapayiyvetat 8 
Trapayiyvopat 
Napadeioou 123 
Mapadioou 123 
Topéoxgv 160 
napdevixov 165 
mácyovta 166-7 
mnóoyov 164, 166-7 
néAe 36 

TépWas 109 
nepupag 111 
méupov 170 
népuyw 109 
ménepi 31 

mepei 126 

nepi 126, 160, 165 
TIEPLYELVOHEVWV 8 
mepiyiyvopévuv 8 
nüxgog 28 

npa 161 

rñpov 161 
mnüxeoc 28 

TAEEL 36 

Toño 31 
noioe 31 
nomoeg 109 
nontik 165 
Toloupevwv 165 
TOAAOUG 67 
mopeia 109 
Topra 109 


foe) 


Topne 109 
mpeopiotatov 44 
mpeoplotatov 44 
npnvij 167 
mpodiakxatexouoa 165 
mpoküntov 165 
npóc 25 
Tpoopopüc 43 
miepuyetov 166 
miepuyeiou 166 
nunoig 109 
TWAKOV 165 


pa 96 
-p 163 
péáupotog 43 


canpóv 31 

oe 119 
oc8uvoyAoupetep 79 
owap@v 47 
opertiot@v 44 
OPEULWTOV 44 
OMI OTOV 44 
ouùpvn 27 

oot 109,119,124,127 
oou 127 
onouónodto 160 
coynpaticavteg 47 
Otpatteds 129 
otpatnyos 129 

ou 109 

cuppétpou 165 
ov 79,124 
opodpa 165 
oyfijpa 164-5, 167-9 
oxńpatı 165-6 
oxnpatiZev 165, 167-9 
oynpaucéo0no 167 
oxnpotiZiv 165 
oxnpotigovta 166 
oynuotíGo 164 
oxnpatixov 165 
oynpatícavteg 47, 166 
oxnpatwv 165 
OyiotoÜ 25 
OXoAaoTIKOG 79 
o@pya 164 


tà 133, 160, 164 UypoKnAn 28 QAoióv 29 
Taig 165 Uda 29 QAoióg 29 
Tauty 109 Udpeiav 160 QAoüv 29 
Tayelov 119 Udpeiag 160 QAoüc 29 
Tdaxlov 119 Udpia 159-61 opeitat 29 
te 31 Udpiat 160 pics 164 

tf 109, 127, 143 Udpiav 160 

tjv 143,161,170 Udpiag 160 xyoABdvn 25 

tfjg 43, 116, 143, 160, 165 Udpiokn 160 xyoAkdvOou 25 
ti 160 UdpoKnAn 28 xelpoupyeia 166 
Tó 160, 164-9 Hdwp 29, 159 XElpoypapeion 124 
totyapoüv 168 Upétepog 79 XElpoypagia 124 
toic 28,165 unap 27 xelpoupyinv 164 
tolol 28 Undpyw 8 Xu£0Aag 52 
tóv 96,125, 165-9 úneðeičapev 165 XipétAag 52 
tomog 164 ÙUmép 27 Xpewoteiv 124 
Top 96 Uneppule)la 131 xpovos 164 

toU 160-1, 166, 170 UTeppueta 126 xpò 25,29 
Tous 166 Umep@uia 126 xUtpa 27 

tpia 109 Unnpétar 164 xopíov 124 
Tpixa 166 nuov 167 xwpig 160 
tpómoc 164 

TUXNt 127 gappaxofkar 162 @KTW 125 

tà 165 qoppako0rkn 161-2, 170 Wvopa 27 

tüv 25, 43, 47, 125, 165 Qoppoko8rknv 170 Gonep 165 


V. XML tags 


<ab> 18 
<abbr> 24-6 
<add> 42-3 


Qóppokov 162, 170 


<del rend="cross-strokes"> 44 


V.XML tags — 189 


<del rend="erasure"> 44 
<del rend="slashes"> 44 


<add place="above"> 42 
<add place="below"> 42 
<add place="inline"> 44 


<add place="interlinear"> 42 


<add place="left"> 42 
<add place="right"> 42 
<am> 25-6 

<app> 37,45 

<app type="variant"> 37 
<bibl> 144 

«choice» 26-7, 31-2, 143 
«corr» 26, 31,121 

<d n="xx" v="xx"> 145 
<del> 44 

<del rend="corrected"> 44 


<desc> 19,27 

<div type="textpart"> 18 
<e> 145 

«ex» 24-6, 31 

<expan> 24-6, 31 

<f> 145 

<figDesc> 35 

<figure> 35 

<foreing xml:lang=""> 27 
<g> 30,36 

<g type="check"/> 31 

<g type="dipunct"/> 30 


<g type="horizontal-stroke"/> 25 


<g type="middot"/> 30 
<g type="mid-punct/"> 142 


190 — Indices 


<Q 
<Q 
«g 
«g 
«g 


type- 
type- 
type- 
type- 
type- 


"monogram"» 25-6 
"sinusoid"/> 25 
"slanting-stroke"/» 25 
"tripunct"/> 30 
"vertical-stroke"/> 25 


«gap» 19, 32, 46 
<gap reason="ellipsis"> 19,27 


<gap reason="illegible"> 19 
«gap reason="lost"> 19,32, 46 


hi 
hi 
hi 
hi 
hi 
hi 
hi 


hi 


hi 


hi 


< 
< 
< 
< 
< 
< 
< 
< 
< 
< 
< 
<hi 
< 
< 
<hi 
< 
< 
< 
< 
< 
< 
< 
< 


DD Do p o 


rena 
rena 
rena 
rena 
rena 
rena 
rena 
rena 
rena 
rena 
rena 


rena 


=" 


| r— XX 


n="x 
n="x 
n="x 


n="x 
n="x 


n="x 


handShift> 112 
handShift new="m2"/> 20 
hi» 20,31,34-5 


-"acute"/» 35 
-"asper"/» 35 
-"circumflex"/» 35 
="diaeresis"/> 35 
="eisthesis"/> 34 
="ekthesis"/> 34 
="lenis"/> 35 
="subscript"> 20 
="superscript"> 20,25 
="supraline"> 20 
="supraline-underline"> 20 
="tall"> 20 

"> 145 


1b/» 18, 20, 144 


x"> 31,144 

x,md"/> 41 
x,minf"/> 41 
x,ms"/> 41 
x,msup"/> 41 

x" break="no"> 31-2 


<lb n="xx" rend="indent"/> 33 

<lb n="xx" rend="outdent"/> 33 

«lem» 31, 37,47 

<milestone rend="box" unit="unde- 

fined"/> 20 

<milestone rend="coronis" 

unit="undefined"/> 31 

<milestone rend="diple" unit="un- 

defined"/> 31 

<milestone rend="diple-obelismene" 

unit="undefined"/> 31 

<milestone rend="paragraphos" 
unit="undefined"/> 30-2 

<note> 17,143-4 

«num value="xx"> 26, 32 

«orig» 26-7, 32,122,143 

<p> 145 

<q> 36-7 

«rdg» 31, 37,47, 89 

«reg» 26-7, 32,122, 143 

<sic> 26, 31,121 

<space> 32 

<subst> 44 

<supplied> 46, 143 

<supplied reason="lost"> 31-2, 46 
143 

<supplied reason="omitted"> 31,46 


<supplied reason="parallel"> 46 
<surplus> 46 
<title/> 141 
<unclear> 19 


